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Sir: 

r, Preben Nielsen , do hereby state and declare that 



1. 



I am a citizen of Denmark residing at Hoersholm, Denmark. 



2. I am a Science Manager at Novozymes A/S. J have a Masters of Saence degree in 
Biochemists^ from Copenhagen University (Copenhagen. Denmark) and a PhD in Biotechnology from 
Heriot Watt University (Edinburgh, UK). I have expertise in microbiology and systematic biotechnology, as 
well as in production, detection and characterization of extra-cellular enzymes. Since 1995, I have 
worked with screening and production of bacterial enzymes. 

3. I have read and am familiar with Boyer et al. (US Pat No. 4.061.541). I was requested to 
make a comparison between the alpha-amylase obtained from the bactenum NRRL 3861 . which is described 
in Boyer et al., with the alpha-amylase produced by Bacillus sp. NCIMB 40916, having an amino acki 
sequence as shown in positions 1-556 of SEQ ID N0:4 of the above^aptioned U.S. patent application. 

As discussed below, it is my conclusion that the alpha-amyiase stated by Boyer et al. to be produced 
by the bacterium NRRL 3881 would not be identical to the alpha-amylase produced by the bacterium Bacillus 
sp. NCIMB 40916 because, among other things, the bacterium NRRL 3881 is phylogenetically different from 
the bacterium Bacillus sp. NCIMB 40916. Moreover, despite following highly specific, art recognized 
procedures, we were unable to show the presence of a detectable amount of alpha-amylase activity from 
NRRL 3881. and It is my conclusion that NRRL 3881 does not produce an alpha-amylase 

4. We have perfonmed a classification of the bacterium NRRL 3881 , and based on the 16S 
rRNA gene, we concluded that NRRL 3881 is a strain of Bacillus halodurans. 
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5. We Obtained a sample of the bacterium NRRL 3881 and attempted to produce the alpha- 
amylase alleged to be produced from NRRL 3881 by U S, Patent No 4,061 ,541 in a method equivalent to 
the method described in U.S. Patent No. 4,061,541 for comparison to the alpha-amylase from Bacillus sp. 
NCIMB 40916 Specifically, we did a Phadebas assay (Pharmacia), which is standard assay known in the 
art for analyzing amylolytic activity. This assay is particularly good for alpha-amylase activity, but side 
activity will be seen from other amylolytic enzymes, such as CGTases and alpha^lycosldases. This 
assay is a very sensiHve assay, and would detect the alpha-amylase activity if it was present. 

We fermented the NRRL 3881 strain and ran the Phadebas assay twice (pH 9.5), and both times 
we were unable to produce sufficient amylase to make a proper pH profile. Based on this test, we 
concluded that NRRL 3881 does not produce an alpha-amylase. This conclusion is consistent with 
classification infomiaiion, which shows that NRRL 3881 is a Bacillus halodurans strain. Bacillus 
halodurans is reported in the literature to produce straight starch degrading enzyme activity, but from 
genome sequence it is known not to produce alpha-amylase. See Preben Nielsen et al.. "Phenetic 
diversity of alkaliphillc Bacillus strains: proposal for nine new species," Microbwlogy 141 , 1745-1761 
(1995) and Takami et al. -Complete Genome Sequence of the Alkaliphilic Bacterium Bacillus halodurans 
and Genomic Sequence Comparison with Badllus subtilis," Nucleic Acid Research, 28, 4317-4331 
(2000). 

This conclusion is also supported by the PGR reaction we performed on NRRL 3881 In particular, 
we made a PGR reaction using two degenerated primers that recognize the gene of known alpha-amylases 
from bacillus, and were unable to detect a product with DNA from NRRL 3881. 

6. The undersigned declarant declares further that all statements made herein of his own 
knowledge are true and that all statements made on information and belief are believed to be true; and 
further that these statements are made with the knowledge that willful false statements and the like so 
made are punishable by fne or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code, and that such willful false statements may jeopardize any patent issuing thereon. 

Signed this day 2S of August, 2003 



Preben Nielsen 
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One hundred and nineteen strains of alkaliphilic and allcalitolerant aerobic 
endospore^forming bacteria were examined for 47 physiological and 
J'^'^?.!^?' characters, and DMA base composition. Numerical analysis (S, and 
5,JUPGMA clustering} revealed 11 clusters that comprised three or more 
strains. Most of the phena were further characterized by analysis of 
carbohydrate utilisation profiles using th« API 50CH system, but strains of two 
taxa could not be cultured by this method. DNA reassociation studies showed 
that nine of the phena were homogeneous, but strains of phenon 4 and 
phenon 8 were each subdivided into two DNA hydridization groups The 
strains could therefore be classified Into 13 taxa plus a number of unassigned 
smgle-membered clusters. Two taxa were equated with B^diius cohnii and 
a alcaiophiius and nine of liie remainder are proposed as new species with 
the following names: A. agaradhaer^ns sp. nov. B, darkii sp. nov, R dausu sp. 
nov., B. gfbsomV sp. nov., B. haimapalus sp. nov^ B. halodurans comb, nov fi. 
horikoshii sp. nov., ft pseudalcaiophilus sp. nov, and ft pseudofirmu^ sp. nov 
Two taxa were insufficiently distinct to allow confident identification and 
these have therefore not been proposed as new species. 



Keywords: alkaUphiic, bacilh^, classification, nomcuckrurc. taxonomy 



INTRODUCTION 

During an attempt to improve the enrichment medium 
for Vibf^o chcUrae, Vedder (1934) isolated aerobic 
cnHospore-formlng bacteria from hajtian faeces, and later 
from animal faeces, which proved ro be obligatcly 
aikftUphihc organisni.s (defined as a pH optimum for 
groNvth above pH 9 and no growth at pH 7). He proposed 
the name Baaliu^ akalophUm for his strains and staied that 
he had been able to prove that life exists which not only 
tolerates, but depends on, a highly alkaline pH. Today 
these and other uikahphiUc Ba.illus strains are of con- 
siderublc industrial interest, paruculariy for rht pro- 
duction of enzymes such as proteases for inclusion in 
laundry detergents (A unstrap ,t at., 1972), xylanascs for 
use in the pulp paper industry (Nakamura ai.^ 1993) and 
cyclodextrin glu can otnmsf erase for cyclodcxtfin manu- 
facture from starch (Kitamoto ft al., 1992). These 
industrial applications have prompted vhe isokuon of 
strains from a variety of alkaline environment. 
(Honkoshi, 1991 ; Jones stai, 1994). The ;iivcrse bacteria 



tPrtsem addrtos: Mkfobiology Seaion. The Royal Veierinary and 
Agncult^iral Uo.venity, OK-19S8 Fredericklfaerg, Copenhagen. Denmark. 
Abbreviation: MUG, -l-methylumbellif^ryl ylo-glucuronide. 



recovered arc described simply as Bacillus sp. because 
there is no taxonomic framesvork to enable idcnnfi cation. 
In addition to stnct alkajjphiles, some Baallus species arc 
reported to tolerate a more or less alkaline pH and may be 
described as alkaiitolcrant (defined as growth at or above 
pH 9 hut also at pH 7). The aspoiogenuus bacterium B. 
balodtniirificam Pcnariaz al.^ 1989) is reported to grow 
between pH 5-8 and 9-6, bui has an optimum in i.he 
neuiraJ ran^e ar pH 7-4, and two thermophilic BaallH.^ 
species arc rcponcd in actually require « slighily alkaline 
pH: 'B.paiiidui' (Scholi: ctal.^ 1988), which grows best at 
pH while also tolerating pH 7; and 'B. thtrmocloacar: 
(Dcmhartcr & Hcnsel, 1989), which requires a pH of 8-9 
and does not grow at pH 7. 

Taxonomic studies of truly alkaliphiiic BacUlus sirains 
have been few. Boyer et ai (1 973) deemed two halotoJetant 
and alkaliphiiic BacUJm strams to be sufficiently related to 
thc^typc strain of i5. a/calopbilus \o justify subspecies status 
as '5. alcakphiiks subsp. baioduranx\ which necessitated 
transforming J5. aicatophilus to 'B. akakphilm subsp. akalo- 
philm\ This proposal was not included in the Approved 
Lists of Bacterial Names (Skerman tf qL, 1980). In later 
studies, strains of B. aicdophiiuj subsp. hakdt4ram' were 
shown Lo be so different from the parent species, both 
genetically and physiologically, that separate species status 
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was considered more appropriaie (Friue tt al 1990) 
Buyer tt al. (1973) considered their strains ro be distantly 
related to an alUiphUic B. circulans strain described by 
Chjslctt & Kushner (1961). This organism (stram RU 38) 
Aras later classified in a newly established aJkaliphilic 
taxon, Bizr/V/vj' foAwV (Spanka & Fritzc, 1993). 
Some indication of the diversity within alkaliphiJic W/*/ 
strains was provided by Gordon & Hyde (198'?) Thcv 
characterized 174 alkaliphjhc S^r/Z/w strains of industrial 
importance after adaptation of the strains to neutral pH 
anci allocated them to five physiologically defined groups 
Four of ihe^c groups w«c safficiently homogeneous to be 
retrieved in a later investigation with only a few com- 
positional changes (Fritze H ai, 1990). More rccendy, a 
stram of an ajkaliphilic spore-forming organism (growth 
at pH 8^.10 but not at pH 7) was described which whs so 
peculiar m its properties that the authors established a 
new genus for it: AmpbthaciUus xjUtm (Niimura tt al., 
1990). Its lack of cytochromes, quinones or catalase and 
Its ability \ o form spores under aerobic as well as anaerobic 
conditions clearly distinguished this organism from the 
genera Bacillus, ChstrtdiHrn and Sporolactohacillus. 
The aim of die present work was to classify, at the species 
level, a collection of indvstriany important alkaljphilic 
Bacillus strains. Previous Studies (Nielsen ef ai, 1994) had 
indicated thai some of these bacteria were phylo- 
genctically distinct from ail vahdly described EadUus 
species with the exceptions of ihe two alkaliphilic species 
h. akalophilus and 5. cohnii. 

METHODS 

Bacterial strains and culture conditions. The bacterial strains 
used in rhis study are Jisrcd in Tabic 1. The strains >vcrc stored 
as a working collection at 5 °C on Nuirient agir (Difco) slopes 
adju6;cd U) pH 9 with 0*1 M NaHCO^ or to pH 10 with Q-l M 
N^HCO^ and 0-1 M Na,CO, (sodium sescjuicarbgnate) and 
supplemented with O'l % MnSO^ and 0-] % MgCl^. 
Phenotypic tests. Phcnotypic te<its were performed u^inc the 
methods of Ciordor ft at. (1973), ^nth media adiusred Lo 
approximardy pH 10 by the addition of 100 ml of 1 M 
sodium sescjuiearbon.tc baffcr, except fcr the phenylalanine 
tesr, xvherc 50 mj I'^ ^va5 used (Friue a/., 1990), Tests were 
repealed where unclear Or unexpected results were obtained. 
pH range for growth. Nutrient a^^ar (1 litre) wi.? adjusted ty 
various pH values: pH 6-0 (adjusted by adding HQ). pH 7-0 
(addition of 100 m] 1-OM 5odium phospliaie buffer, pH 7 0) 
pH 8-0 (addition of 100 ml 1*0 M Na.HPO,), pH 9-0 (addiuon 
of 100 mi ] M NaHCC)^), pHlO-0 (addition of 100 ml 1 M 
sodium se,.cjuicarbonaLe buffer, pH 10) The agar media and 
huifers were autoclEvcd separately. Agar plates were inoculated 
by .-^rreaking to give discrete colonics, and growth intensity was 
interpreted on a .calc from 1 -5 after 1 , 3 and 5 d. Coding for the 
numerical analysis it^ described in the footnoie tu Table 2. 
Utilitation of carbohydrates. An e^rly stationar)- ph^se culture 
in Nutrient brc.rh (Difco) wa. centdfoged, and rhe cells 
resuspcnded ,n an equal amc^uni of resuspmsion buffer (219 e 

Mfc^SO 7H,0 ui 250 ml distHlcd water). Rcsuspended euUure 
(O-,-. ml) was added to a soft agar held Et 42 and compo.sed ol' 
^mi each of two solutions, Solution 1 was KNo f2 t?l 
N.,HPO 2H,0 (2g), NaCl (5 g). 100 ml IM Vdium 
scscjuicarbonaie buffer (pH UVO) and waier to 500 ml. After 

1746 



auroelaving, 2 mi trace clement solution lO-l t ZnSO 7H n 
0-03 g Mna,.4H,0, 0-3 g H,BO,. 0 2 g CoCl, . 6H,0 001 ^ 
CuCl 2H,0, 0-02 e NiCU 61 1.olnd 0^)3 g Na^Moa 2H O 
dissolved m I I J;^"ijcd water with 12-« g titriplex T (Merck) ."nd 
U l g FeClj . 4H,0 added], 2 ml vitamin syiutivn (0-2 me biotin 
2-0 mg nicotinic acid, 10 mg Lhianim, t-O m^> ^aminobcnzoate' 
U Smg pantothenate, 5 0 mg pyndoxamine, 2 0 mg cyano- 
cobalamin, dissolved in U)0 ml water) and 2-5 ml 2% 
MgSO^ . 7H,() were added. Solution 1 was agar (3 g) autoclavcd 
)n 500 ml distilled water. Hie APT 50CH galJcrv (Biomcueux) 
was inoculated wnh two drops of the culture using a Pasrcur 
pipertc. Growda On the various carbohydrates was examined 
after 2, 7 and 14 d as recommtrndcd by the manufacturer. 
Numerical analysis. ITie pheootypjc data, execpt carbohydrate 
utili7auon results, were coded for numerjeal analysis as binary 
characters. Carbohydrate udiization data were excluded because 
some strains failed to grow in the test system, rhus providing an 
mcompJctc data matrix. The pH range for growth was coded as 
debcribed in the footnoie to Table 2 and mol % G + C was coded 
by additive coding into six characters: > 34, > 36, > .38, >40 
> 42 and > 44 moVA G + C Data were analysed using [he 
NT5YS software package (Exeter Software). Simikriry matrices 
were calculated using the Simple Matching (J^„) and Jaccard 
(Jj) coeflTicients, and UPGMA dendrograms were drawn using 
the programs simcqual, and drawthee of NTsys. 
Mol% G + C determination. Miniprcparanons of DNA were 
prepared from cell mass (2-3 inoculation loop?) taken from a^rar 
pUtes. Cells were washed m 1 ml TES buffer f50 mM Tns 
5 mM EDTA, 2-5% sucrose, pH 8) and resuspcnded m lOO ul 
TES buffer. A fuixher 390 nl TGS containing 20 mg lyso^^ymc 
ml wij; added and the tubes incubated at 37 °C for ar least 
30 mm. Lysis buffer (300 ^d 2 mM EDTA. pH 8, containing 
fiOVo ^ruamdinium ihiocyanate and 12 5% SDS) was added 
foiiowed by 250 \i\ 7-5 M ammonium acetate. After mixmg and 
cliiUmg on ice for 10 min, the rubes were centrifuged for 15 mm 
and the supernatant removed to a new 2 ml tnicrofuge tube, The 
prcpararion was extracted with chloroform/itioamyJ alcohol, 
precipitated with 2-propan(]l, washed with 70% erhanol, dried 
and resuspcnded in RNasc buffer (Seidin & Dubnau, 1985). The 
DNA was extracted wiih phenol and then with chloroform/ 
isoamyl alcohol, precipitated m ethanoi, washed in 70 % ctlianol, 
dried find resuspcnded in water. An isoktion Lypically yielded 
approximaiely 20 jig DNA and about 90% of the RNA was 
removed. Hydrolysis and dcphosphoryiation of the DNA wai; 
carried Out as described by Mcsbah tt at. (198';). Nucleoside 
samples, 25 m1 of 100 ^ig mr\ were separated by revcrsc-pha^c 
HPIG at U^^C using 0-6 M NH,H,PO,/aceronitrile 80:6 
(v/v), pH 4-4. as solvcni and a flow rare of 0 7 ml min"^ 
(Tamaoka & Komagata, 1984). The HPLC apparatus consisted 
of a bgh-prcssuxe pump and UV detector (LKB 2l.SC and 
2151). Separation was performed ihrough a piccolumin ajid an 
inalyucal column (NucleoMl 100-5C]fl 20x4 mm and 
250 X 4 mm rcsptctjvely ; MELZ VDS, Berlin). UV absorption 
was detected and analysed by an integrator (CR^BA, Shimadzu). 
Mol% G + C was CulcuUted from deoxyguanosine and deoxy^ 
thymidine contenti; (Mesb:ih tt ai, 1 9B9). 

DNA hybridization. DNA for hybriduations was prepared 
from 300 mi cultures in Tryptonc soy broth (Difco) adjusted to 
pH 9 or 10 as described above using the method r>f Seldio & 
Dubnau (198,S), 'I'he DNA concentration was determined b) the 
diphcnyiamioc assay (Johnson, 1981) with sabnon testis DNA 
in the range 10-100 ug ml"' as sLindaid, Probe DNA wa. 
prepared by random-primcd labelling of uoial chiomosomal 
DNA with 5 ^1 ["S|dATP [600 Ci (22.2 TB<q) mmoi-\ 50 pCi 
(1-85 MHq)] using the Nonapnmcr Kit (Appligcne). Tar/jer 
DNA (UK) ng per sioi) was immobUiied on nylon mcmh^ar:'e^ 
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Taxonomy of alkaJiphUes 



TMe 1. Bacterial strains included in this study 



Sctaui detignation 
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Ha 
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C337 
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C338 
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1 112^2 
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40 






C341 
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10294 




C342 


10 


1 
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1 none 
lUiv5 




C343 


1 1 

1 1 


1 
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10296 




C 34<5 


1 7 
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10285 
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13 
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10297 




C347 
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3029B 




C348 
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10299 




C349 
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10300 




C 350 


1 1 


5* 


10301 




C351 


1 ft 
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10302 




C352 
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10303 




C353 
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10304 




C354 


Ot 


1* 


10305 




C355 




5* 


103 06 




C356 


OX 
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1 0309 


8716 


C360 


7a 
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10317 




C372 


*j 




102B1 




C3n 


OA 


4b 


10288 


8717 


C33C 


_ / 
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RAB 
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4b 
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1O307 




C357 


nii 
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1030B 
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10310 


8718 
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Dl 
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10311 
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1 nil A 
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C374 
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C410 
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5 






C411 
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1032G 




C412 
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U 




2512 


ATCC 21522, 221 


44 


5 




2513 


ATCC 21591. A-59 


45 


6* 




2514 


ATCC 21 536, 0-4 


46 


6 




2515 


ATCC 21537^ V-76 


47 


1 




2516 


ATCC 21592. A-40 


4e 


1 




zsn 


ATCC 21 593, 124-1 


49 


UC 




2516 


ATCC 21594, 169 


50 


nc 




2519 


ATCC 21595, 135 


51 


1* 




2520 


ATCC 21596, 27-1 


52 


UC 




2521 


ATCC21B32. Nl 


53 


UC 




2522 


ATCC 21833, N4 


54 


4b 




2523 


ATCC 31006. 13 



Source§ 



River baqk soil, Denmark 

FicM toil, Deamark 

Lake bank soil, HoUtein. Germany 

Infccnnn on perborarc pbie 

Clfly froiTi field, Holstcin. Germany 

Ccmcncary ioil, Dciinurk 

River baak soi). Denmark 

Field fioil, Denmark 

Garden soil, Denmark 

Garde/; soil, Denmark 

Chicken yftrd soi], Denmark 

Inftiction Ot pcrborAte plarc 

Deer manure, Dwmiark 

Chicken run soil 

Deer rnanurc, Dcflmark 

Frenb lake waicr. Dcnmai'k 

Chicken manure 

Osrrich manure. zOo 

Soil and leavc*i, Deimiark 

Chicken yard 5oU 

Chicken yard soil 

Garden bark 

Garden Soil, Denmark 

Clay from field, Holstcin, Germany 

Wood soiJ, I-Iolstcin, Germany 

IIor?c And elephant manUi'C 

Cllickcn yard soiJ 
Chiekcn ya^d soil 
Lavatory cistern 

Liquid from tannery liming baih 

Baby faccw 

Ostrich manure, 7oo 

Garden soil, Denmark 

Clay from field, Holstein, Germany 

Ostrich manure 

Biephanc iHKnure 

^X'atei from hippopotamus basin 

Tannery liming baih containers 

Tiger manure 

Pigeon manure 

Chicken yard soil 
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Table 1. (conf.) 



Source^ 



Clustcrt NCIMB DSM 



55 
50 
57 
58 
59 
60 
61 
02 
63 
64 
65 
66 
67 

69 

70 

71 

72 

73 

74 

75 

76 

77 

78 

79 

80 

81 

82 

83 

100 
101 
102 
103 
104 
10^ 
106 
107 
108 
109 
110 
111 
U2 
113 
1H 

ni 

118 
119 
120 
121 
122 
123 
\2A 
125 



4b 

6 

10^ 

7 
1^ 

7 
5 
5 

5* 
UC 

uc 

3 
7 
7 
7 
7 
7 
1 

1* 

UC 
6* 
6 
1 

7 

bb 

5 

1* 
10 

7 

UC 

10* 

10 
3 
3 

3 

2 

2 

2 

2 
11 
11 
11 
11 
11* 
UC 

8b' 

8a 

fib 

8b 

8b 

6b 

Bb* 

ba 
UC 



Othc 



2524 ATCC 3)007. 17-1 

2525 ATCC310e4, M^29 
10438 2520 NRS1554, NCTC 4554 

9218 2528 RU 38 
10280 6930 C 334 
10327 6932 C413 
10313 6939 C367 
10316 6940 C371 

6941 BC 4 

6942 BC 7 

6943 PB 9 

6944 BB 1 6 
5A 
5' A 
14B 
15B 
15' B 
AF 1 
AF2 
Br-B 
Bf'E 
Br-F 
BC3 
PB 38 
PB19 

02 
03 
M 5 
M 8 
JA 14 
JA 16 
JP395 
AC 66 
K51 
AC 13 
K320 
K 316 
AC 59 
PD 138 
H53 
DP 182 
J 102 
J 114 
1 20 
SF 16 
8723 JP170 
674 X 
jP75 
8719 JP277 
MHas96 
PD 4% 
JI'216 
BC7 



6950 
6913 
6951 
497 
7316 
7317 
7318 
7319 



t)720 



8721 



8722 



Contamination, aJkaliiic agar plate 
River bank soil, Denmark 
Clay from iidd, Holstcm, Gcnniny 
ElephaJit manure 
Blephanr manure 
K, Aunsrrup 
K, Aunsrrup 
K. AunRirup 
K. Aunstrup 

Water/sand. Wadi Narrun, Egypt 
WaLer/sand» ^idi NauLm, Egypi 
Wfltei/sand Wadi Naiiim, Egypt 
Water/Kand, Wadi Natnm, Egypt 
Water/pand, Wadi Nacrun, Egypt 
Scakctite Cflve, France 
Sralactirc cave, France 
Soil, Brazil 
Soil, Bra7il 
Soil, Brazil 

Aunstrup 
K. Aunstrup 
K- Aunstrup 

'B, a/caiopJbi/us suhi^p. halodurant* 
Owens Lake, Californja, USA 
Owens Lake, C^liTomiA, USA 
Mono Lake, California, USA 
Mono Lake, California, USA 
H. Ouiirup 
H. Ourirup 
H. Outtrup 
H- Outtnip 
H. Ouitrup 
H. Outirup 
H, OuiLiup 
H. OurLrup 
H. Ouiirup 
M. OuLtrap 
H. Outuup 
H. Outtrup 
H. Outtrup 
H- Cluttrup 
H. Outtrup 
H. Outtrup 
H. Outtrup 
H, Outtrup 
H. Outtrup 
H. Outirup 
H OuLtrup 
H. Oottnip 
Outtrup 
K. Aunsrrup 
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Table 1. (cont) 



Strain d«si^atioQ 



Sou 



References |j 



PN 



Clufiicrf NCIMB DSM 



Other 



127 
129 
130 
131 
132 
133 
135 
130 
137 
136 
139 



2 
2 
9 

10* 

9 
11 

y 

9* 

9 
10 
10 



8725 



AT 67 


H, OuTifup 


AP 72 


H. Ourtrup 


DP 155 


H. Ouiirup 


DP 45 


n. Outtrup 


DP 100 


U. Ovitrup 


J 26 


H. Ourtrup 


J 4(53 


H. Outtrup 


J 466 


H. Oucmip 


DP 44 


H. Outtrup 


DP 469 


H. Outcfup 


DP 466 


H. Outtrup 



t This stmn w»5 ,ecov«ed « a single-membsred duste, by J,/UPGMA .nalysij bur DNA hybridisation revealed that u a membsr of 

tJTliS 

SK. Aunsrrup and H, Outtrup, Novo Nordisk A/S, Bagsvacrd, Denmark. 

I 'PJ' 1' ^"'^^ ^^^^^ ^ f^^^l)' ^"^^^ (^990) , r, Gordon 6c llydc (I9b2) ; 

i v7d^, (l/^f ^' ' ^ ^' ^' ""^'^^^^^ f^^'^)' ^-1-^1^°^^ & Il^-da (1997);^, Nielsen (1994) ■ ^. Spank. ^ Fritz. (19^3) ! 



(Nytran N, Schleicher aiid Schuell) using rhe iloc-blot system. 
Hybridiziiiions Nvere carried out Qvcrnighi at 62 °C fAlcxindcr 
Be Priest, 19HV). Percentage rcassuciaTJon was; calculated ftom 
the radjoacnvity of rhc hybrids rclauvc ro homologous conirols 
and with ^vbu-aaion of unspecjfic background radiation oc- 
curring frcm iion-hdmologoas (salmon testis DNA) 
hybridirauon (Sddin & Dubnau, 1983), 

RESULTS 

Phenetic characterization 

Our inidal studies revealed that the alkaliphilic strains 
it^cluded in this study did not produce acid from glucose 
^X'hc^ examined using the classical lest procedures 
(Gordon p/^/., 1973) adapted for use at alkaline pH. Sugar 
fermentation rests were therefore excluded from rhe study. 
Moreover, soiT)e strains, notably those of cJupier 3 and 
cluster 7 (B. cobnii), failed to grow or produce a reliable 
indication of carbon urilization in the modified API 50CH 
rests ; since these data were incomplete they were excluded 
from the numerical analysis. The numerical dassiiicaiion 
was therefore based on 47 charaacrs including DNA base 
compobition, which was coded as six binary characters 'as 
described in Methods. Most of these tests had previously 
been shown ro be useful for classification of alkahphilic 
Baciiikj strains (Fiitze d aL, 1990). 

Despite the relaavely few tests, the 119 test strains were 
consistently allocated to 11 cluster-? ui analyses using the 
Jaccard (at the 80 % similarity level) and simple matching 
(at the 90% similarity level) coefficients wjth avera|,'c 
Unkage (UPGMA) cluster analysis, The only diH'crences 
between the iwrj classifications were that strain PN-10 
clusiered wiih phcnon 1 and strain PN-82 with phenon 10 



in the Jj dendrogram but both were recovered as sint^le- 
mcmbered clusters by S^y^ analysis. Subsequent evaluation 
of these results by DNA hybridization revealed that the .Vj 
allocation was correct for strain PN-10 (DNA could not 
be prepared from strain PN-S2) and therefore this 
classification is shown in Fig. 1. Nine strains were 
recovered as single-membered dusters and two strains 
(PN-64 and PN-125) as ^ doublet. Biochemical and 
physiological characteristics of dusters 1-11 arc shown in 
Table 2. 

Determination of carbohydrate utilization required de- 
velopment of a minimal medium. Ammonium was 
ineffective as a nitrogen source, probably because am^ 
moma gas was released at tlie hi^h pH, and therefore 
nitrate was used in a minimal medium (sec Methods) al 
pH 10. The patterns of carbon source utilisation are 
shown in Table 3. Utilization of aesculin could not be 
detected because the substrate wa.s unstable under the 
alkaline conditions. Eight carbohydrate substrates were 
cacaboiizcd by essentially all strains tested. These were: 
cellobiosc, fructose, glucose, glycerol (with the exception 
of group 8a), maltose, mannitol, sucrose and trehalose. 
Conversely, no growth was obtained for any strains on 
the following substrates: adonitol, arbutin, L-arabitol, n- 
hicose, L-fucoSe, inuJin, 5-ketogluconatc» sorbose and l- 
xylosc, The percentage distribution of positive characters 
for the taxa for the remaining 31 substrates are listed in 
Table 3. 

The clusters derived by phenotypic analysis were, wirh 
the exception of cohmi (phcnon 7), which had been 
previously studied by Spanka & Priti;e (1^>93), evaluated 
by DNA hybndiv:ation. The Same reference strains were 
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Clualar 
Numbsr 



6 
9 

1 0 



a, fi, 7 3, 10. 11, 
12, 13, 14. 15, 

16, 10, ig, 21, 
27. 47, 48, 51, 
53, r?„ 73, 77, 
B1 

IM, 106, 107, 
100, 122,^ 

102 103, 1CW 
1. 2- 5, 8. 26, 

17, 22. 38. 29, 
3J2. 31, 32, 36. 
37. 38, 39, 40. 
41. iZ, 4i 61. 
62, S3, ee. BO 



ia. 24. 35, 
43, 45, 46. 
56, 75, 7€ 

4. 34, 58, 60, 

$7, 66, 70, 
7t. ?e, 63 

2S. 117, iia, 

119, lao, 121 
152, 123.124 

130. 132. 135, 
136, laz 

131, 13B, 139, 
ez, 100, 101 



j-Dfl , 110, 111, 
11Z, 113, 133 

50, a^sx 



idonlily / 
propQSBd Hums 

0. psauOotirmus 



B. agnradhBerBns 
B. darkil 



B. dausH 



B. cohnii 



9. halmapalus 
S harikoshii 



B. paouaottlcaiophffus 
S. (Ucafophitus 

B. gibsonti 



40 50 50 70 80 90 
Percentage similarity 



h^'J^J^r^ tnT^^^^^ -S"^'"^ allocation of strains to clusters based on the 5j/UPGMA analysis of the 

™? characteristics. Bold underlined strain number, indicate those stra.ns for which the full I65 

^wQM^ « ' u (Nielsen et a/., 1994); underlined strain numbers indicate those for which a partial 

165 rRNA sequence has been determined (unpublished). 



used as in the previous rRNA sequence analysis study of 
these bacrerja (Nielsen ef a/., 1994). The rRNA study had 
indicated that dusters 4 and 8 may be heterogeneous and 
therefore ru>o reference strains vcrc chosen for each of 
diesc clusters (sec Table 4). In general, the classi£cauons 
based on phenctic ajialysis and DNA hybridization were 
coniisccnL, although there were some discrepancies, as 
described below and in the descriptions of phena. 
DNA from none of the reference strains hybridized 
significantly with DNA from any other reference strain, 
thus bupportin^^ the mtegrjt)' of the 11 clu.^tcrs- Clusters 4 
and 8 were each shown to represent at Icasr two DNA 
hybridizauon groups, wliich arc consequently labelled h 



and b in Tables 1, 2 and 3. DNA from only one of the 
single-mcmbered clusters hybridized significantly with 
DNA from any of the reference strains (datA not shown), 
supporting the recovery of these Strain? outside the 11 
cluftcri; ID ihe numerical analysis. The exccpuon was 
strain PN-57^ which showed high binding to DNA from 
B. slcalopbilks DSM 485'^ and was thus considered as ft 
member of cluster 10. 

Analysis and descriptions of the groupings of the 
alkaliphilic Badllus strains 

Phenon 1. This large group of 23 strains, derived from 
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Table 2. Biochemical and physiological characteristics of clusters shown in the S,/upgma 
analysis modrf.ed according to the DNA hybridization resulu ^/u^-^^ma 



t-lltstrr .. 


1 


2 


3 




1 Q 

Is 


4 


3 


Hydrolysis oO 








Hippuratc 


0 


0 


lOO 


' MUG 


0 


0 


0 


Pullulan 


11 


50 


0 


Svarch 


100 


100 


0 


Twcen 20 


0 


0 


0 


Twcen 40 


100 


100 


100 


Twccfl 60 


too 


100 


100 


Dcamination of: 








PhcaylaJajiiae 


100 


0 


0 


Reduction of: 








Nitxatc 


0 


100 


100 


pH optimumt 








7 


11 


0 


0 


> 9 


100 


100 


100 


10 


17 


100 


100 



> 10 
Growth ai: 

10 

40 °C 

45 

50 ''C 

frrowth in NrtCl: 

5 % 

7% 

6% 

9% 
10% 
11% 
12% 
13% 
H% 
15% 
10% 
17% 
1ft %^ 



6 lOO 100 

100 100 0 

100 100 lOO 

100 100 ion 

0 0 0 

0 0 0 



100 
100 
100 
100 
lOO 
100 
100 



100 
100 
100 
100 
100 
100 
100 



100 100 

100 100 

100 IOC 

100 100 

78 25 

0 - 



100 
100 
100 
100 
100 
100 
100 
100 
100 
100 
100 
67 
33 



4a* 


4b 


s 


6 


7 


8a* 


Sb 


9 


10 


11 




5 


16 


6 


11 


2 


4 


4 


5 


5 


0 


20 


6 


0 


100 


100 


lOO 


0 


0 


0 


0 


0 


0 


0 


100 


0 


n 


0 


0 


100 


60 


100 


lUO 


0 


100 


100 


lOO 


100 


100 


0 


100 


100 


1 nn 


100 




100 


lOO 


100 


100 


0 


0 




IZ 


0 


u 


0 


0 


0 


20 


0 


0 


U 




0 


lOO 


0 


75 


75 


100 


60 


0 


0 


100 


0 


100 


0 


75 


75 


100 


60 


n 

u 


u 


u 


0 


0 


0 


0 


0 


u 


0 


0 


0 


12 


100 


100 


0 


0 


0 


20 


40 


100 


100 




inn 


p4 


lOO 


100 


0 


0 


100 


100 


100 


100 


0 


100 


0 


0 


100 


100 


0 


07 


0 


95 


0 


0 


0 


0 


100 


80 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


100 


100 


0 


0 


7B 


100 


100 


100 


100 


80 


0 


100 


100 


100 


100 


100 


100 


100 


100 


0 


0 


0 


100 


100 


100 


0 


0 


0 


0 


0 


0 


0 


100 


100 


0 


0 


0 


0 


0 


0 


0 


0 


100 


0 


0 


0 


0 


0 


0 


0 


100 


100 


too 


100 


1 nn 

1 UU 


A 

V 


lUU 


100 


100 


100 


100 


100 


100 


J \J\J 


100 


u 


1 nn 


100 


100 


100 


100 


100 


100 


ion 




u 


inn 


100 


100 


100 


100 


100 


100 


b4 


36 


0 


50 


100 


0 


100 


100 


100 


100 


fi4 


0 


0 


0 


100 


0 


40 


100 


100 


100 


0 


0 


0 


0 


0 


0 


20 


100 


100 


100 


0 


0 


0 


0 


0 


0 


20 


100 


100 


35 


0 


0 


0 


0 


0 


0 


0 


100 


100 


0 


0 


0 


0 


0 


0 


0 


0 


100 


100 


0 


0 


u 


0 


0 


0 


0 


(1 


0 


20 


0 


0 


0 


0. 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 1 



AIJ «rAins we po„r;vt for ,hc following tcsis. a.,cm gclaur hydroly.^is. growth ai 15, 20, 30 and 
37 r pH optimum for grc>vth > 8 (.« bclowj. Th=y Nvcrc -Al ncRacivc: m the foUowiuf rc;>cTions 
hyri,0ly„s Of Twccn 80, and gro^.h .t .SO 'C in 20 % N>CI (sirainf PN-54 PN-12^Li.v=tr 
tne Ust mcDiiunedj. 

fZth'rL^xl'tlT'^' ^%r'^r''^ ^"^"^^^ hybrldiza^on resales, AIJ urhcr cluster, 

(with rbe rxccptiOn of c)a.i.r 7) «re based on .Tr^uns with > 65 % homology tn rhc reference Strai.. 

t pH 'nprimam' rorgrowih coded as po.itiv. ' > 6' iBdicaie. .hat t^rowth pH 8 wa. .srimAtcd to be 

roj- pll 7, a po«nvc rcco;'d indicaics i;rnw:h ar ncyrrality, r k f 

^ A dash ipdicaccs no data available for thar entry. 



&u]ls from northern Europe, formed a cLscrcie cluster 4t 
82% .r^. Eighteen of these strains showed > 6S % DN\ 
hybrulizanon with strain PN-3 whereas four srrains 



hybridised to a lesser extent with the reference- strain (bcc 
Table 4). The chHractcrisrics of thc5c })acTcf ia are shown in 
Tables 2 and 3, They were purtJcuUrly notable for their 
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r^fll^f; °^ wrbohydrate substrates by strains of dusters shown in the 

5/UPGMA analysis modified according to the results of DNA/DNA hybridization 



Subfjtfiitc 


Cluster . 


1 


2 


4a 


4b 


5 


6 


8fl 


8b* 


9 


lOf 


11 




No. of Sttains . 


. IB 


4 


3 


5 


16 


6 


2 




4 


3 


5 












Percentage positive 








2 


Erythrirol 


0 


0 


0 


0 


0 


50 




Q 


0 


n 

'/ 


w 


3 


D-Arabinoiic 


22 


100 


0 


20 


44 


50 


0 


0 


25 


6? 


20 


4 


L-ArabinoBc 


5 


100 


0 


40 


94 


100 


0 


Q 


100 


100 


100 


5 
<i 


R)bo5c 


89 


100 


0 


40 


lOO 


100 


Q 


J J 


100 


100 


100 


D-XylOiit: 


55 


100 


0 


20 


100 


100 


50 




100 


33 


40 


9 


Mcihyl ^-p-xylofiidc 


0 


0 


0 


0 


31 


33 


0 


0 


75 


100 


40 


10 


Galactose 


5 


100 


33 


60 


100 


83 


0 


0 


100 


100 


40 


13 


Maonose 


55 


100 


33 


60 


100 


inn 




JO 


0 


33 


100 


15 


Rhamnnsc 


0 


60 


0 


0 


100 


100 


n 

U 




75 


100 


40 


16 


Duicitol 


0 


0 


0 


20 


0 


100 


Q 


n 


0 


Q 


y 


17 


Inositol 


5 


0 


0 


60 


100 


17 


Q 


f) 


0 


33 


Q 


19 


Sorbitol 


0 


50 


0 


0 


Q 


100 


A 
U 


n 


0 


0 


y 


20 


Methyl a-D-mannnside 


0 


25 


33 


0 


Q 




A 

u 


r\ 
IJ 


0 


33 




21 


Methyl a-D-glucosidc 


17 


100 


33 


20 


94 


inn 


n 
U 


y / 


lOO 




A 


12 


N-Accrylglucosamme 


100 


100 


33 


80 


100 


100 


100 


1 00 

1 WW 




100 




23 


Amygdalio 


22 


100 


33 


60 


100 


100 


100 


33 


100 


lOO 


60 


26 


Salicin 


67 


100 


67 


60 


100 


100 


50 


0 


100 


inn 

1 yv' 


inn 

LWW 


2y 


Lacrosc 


0 


50 


0 


0 


100 


63 


0 


0 


100 


100 


1 nn 

LWW 


30 


Mdibioit: 


0 


100 


33 


60 


100 


100 


0 


0 


0 


inn 


100 


34 


Mclczitoiie 


0 


25 


33 


60 


100 


100 


0 


0 


0 


33 


100 


35 




0 


100 


33 


60 


94 


100 


0 


0 


0 


100 


100 


36 


Search 


95 


100 


0 


20 


100 


100 


100 


100 


100 


100 


0 


37 


Glycogen 


67 


100 


0 


0 


100 


100 


100 


100 


100 


100 


0 


38 


Xyiicnl 


5 


0 


0 


20 


81 


100 


0 


0 


0 


0 


0 


39 


Genriobiofcc 


11 


in 


0 


20 


56 


83 


0 


67 


100 


100 


100 


40 


u-Turanosc 


39 


100 


0 


20 


94 


100 


100 


67 


100 


100 


100 


41 


D-Lyitos€ 


22 


100 


0 


20 


87 


100 


0 


0 


75 


67 


0 


42 


D-TagfttOsc 


0 


75 


0 


0 


6 


100 


0 


0 


75 


67 


0 


45 


i)-Arabiio] 


16 


75 


0 


40 


75 


83 


0 


0 


0 


33 


0 


47 


Gluconate 


61 


0 


0 


20 


100 


17 


50 


67 


25 


0 


0 


46 


2-Kc(oglucoaait; 


5 


100 


33 


2U 


100 


83 


50 


67 


50 


0 


20 



All strains assigned to clu.tcf. were pnjiuvc lor the folJowini; mbstratcs: cdlobio.c, fvuciOSc. glucose 
glycerol, maltose, sucrose aaJ trehalose; and ncfjauvc for adonirol, L-ambicoi, D-fucose L-fucosc inul.n' 
2-kctogiucoiiatc, L-3cyloie. ' ' 

*Srrain PN-120 vmable ro grow in the modified API system. 

t Strain PN-57 unable to gro* m the modified API .lystem. 



high NaQ tolerance and ability ro deanunatc phenyl- 
alanine. The morphology of the reference strain showed 
rypica] oval cndosporcs in fl non-SNvollcn sporanaium 
(Tig- 2a). ^ 

Phenon 2. These six bacteria, like rhuse of phcnon 3, were 
obtained from soil or mud samples and had a similar 
requirement for sodium ions and pH. The mean DNA 
base compoisition of these bactcna was lower tlian that for 
strains of phenon 3 (39-3 mol% G + C compared with 
42-6 molVn; sec Tabic 4). Four of these strains formed a 
dijitinct ;;roup based on phenoi:yi:»e and DNA 
hybndJzution (>73Vo DNA reassociat lon), but DNA 
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could nor be ipolutcd from uhe other two strains, PN-127 
jmd PN-129. The reference strain produced oval spores 
which clearly distend the sporangium (Pig. 2b). Tliesc 
bacteria can be distinguished by their ability to hydro lysc 
cellulose and xylan effectively (H, Outirup, pei-sona] 
communication). 

Phenon 3. The three b trains in thi^ laxon, all isolated from 
soil ur mud samples, formed v. tight cluster at 92% S^, 
DNA from the three strains showed a high degree of 
reassociation and close similority in moi % Gi-C com- 
position (Tabic 4). They were moderately halophilic, 
showing good growth at 15% NaCl. Ividecd, physio- 
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dttterf " ^""'P'*^'^'^^ hybridization of DMA to reference DNA from strains representing the phenotypic 

AJI Tiguies ID pifcnthcFicj are siandflrd dtrviauons. 



au,«i Suaint Mol % G + C Hybtidizatiun ro rcfr^en« DNA (%) 



1 


18 srrHins 
*PN-3 

:PN-2l 
PN-73 

' PN-59 


39-6 (0'4) 
39-6 

39- 6 
40 2 
4^7 

40- 4 
391 


2 


4 s:amft 
-PN-105 


3y'3 (0-2) 
39-2 




3 suains 
*PN-1U2 


42-6 (0'2) 
42-4 


4 


a; 3 ufiinti 

"PN-1 

b ; 5 Etrain^ 

^PN-2r) 

PN>25 


(02) 

36-4 

39-5 (0 2) 

39-2 

38'2 


5 


PN-17 
PN-ri3 
PN-22 
PN-44 
PN-38 


42-6 (0-4) 

42-fl 

42-6 

42'7 

42-6 

42-8 

42-6 


C 


0 fitrnias 
*PN-23 
PN-56 

PN-4.S 
PN.75 


43-7 (0-8) 
42'B 
43 '.S 
4.V2 
45'3 


8 


a : 2 simin^ 

-PN-118 

li: 4 scrainfl 

"PN-121 

PN-117 

PN-123 


3ftCi 
38-6 

41 '5 (0-4) 
41-3 
^Q6 
40'G 


? 


4 strains 
'PN-137 

PN-n6 


3S'S (0-3) 

38-2 

398 


10 


4 stTHins 
PN-57$ 
*DSM 465 
PN-lOO 


58'2 (0-2J 

362 

37-0 

37-a 


11 

Single 


^ itiiims 
*PN-lt)9 
PN-]13 
PN-50 


41-3 (0-3) 
41 2 
41-2 
34-0 



* 2 3 4a 4b 5 6 8a Bb 9 10 11 I 

PN-3 PN.105 PN.102 PN-1 PN.26 PN.30 PN.23 PN-llfl PN-121 PN-1 37 DSM 485 PN-J 09 



15 



&2 (11) 



95 (16) 
100 
60 
36 
31 
27 
23 

16 

9 

7 

6 



60 
fi3 
57 
51 

37 

78 (in) 

5 4 12 19 25 12 lOa 12 3 7 IG 

63 
61 
13 

20 5 34 15 20 9 39 

S 4 T3 12 U y 12 



10 



100 


93 (9) 


13 


13 


4 


10 


9 


7 


6 


4 


13 


3 


100 


13 

85(11) 


14 

15(1) 


3 


u 


10 


A 


7 


12 


IB 


4 


7 


IDQ 
17(5) 


13 

fl7 (9) 


7 


10 


9 


6 


5 


7 


12 


4 
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t Strains withm clusurs arc given m Tabic 1 , Strains ^izh more than 65 % sequence homology to the reference siram have been grouped and 
the mean hybrjdizanon value is given. A.tensks show the reactions of the reference srrain of the cluster and ihl, striia iS mcludcd in the group 

t Although suain PN-57 va? recovered as -a .mgle^membcrcd cluster m Fig. 1 u vvas included here as a member of clusicr 10. 

§ -rhcse data ior smglc-membered cluster, rcprtssent a sulgle stfaiil bur arc lypical o( chr data for all the Siiliclc-n.cmbered cUiSters .hown Hi 
lahlc 1 , 



logical sruciics (data noi shown) revcided thai these 
bacteria and those of phcnon 2 v^crc unique among all the 
taxa studied in being unable to grou' in the absence of 



sadiunn ions. They uxrc also unable to grow at pH 7 ^nd 
showed an optimum for growth at pH 10. A disdnctive 
fe.-iture of these biictcria was their inability to hydrolysc 
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Starch. The reference strain produced ova] endospores in 
a clearly swoHcn spmangium (l-ig. 2c). 
Phenon 4. This heterogeneous cluster comprised sttains 
allocated to (^vo DNA hybridizanon groups zs well as a 
single strain which was excluded from these groups 
(Table 4). The three group 4fl strains were largely from 
soil while the five group 4b strains originated from a;iimaJ 
manures. Reference strains from both taxa produced oval 
endospores which did nor swell the sporangia (Fig. 2d, c). 
The physiological teats and the carbon utilization tcsLs 
distinguished these taxa but ihere were no deftniove 
features. Group 4a suams were less tcmper;iture tolerant 
than those of 4b. Uaijv.ation of amygdalin, gentiobiosc 
and mannose also provided some distinction between 
these groups. DNA analysis showed that ^^roup 4^ scrams 
were slightly lower in G-f C content (38-4-38-8 mol%) 
than group 4b strains (39-2-39-7 mol %). 
Phenon 5. The strains m this phenon formed a homo- 
geneous duster ai 90 Vo Sj which was supported by the 
DNA hybridization studies, with only five of the 21 
stiain.s showing less than 65% rcassociation to the 
reference strain. 'Ecteillus alcaiophilus subsp, halodurmu" 
DSM 497 was included in this cluster and therefore its 
name is used for this group. Tlie slightly swollen 
sporangia containing oval endospores are shown in Fii^. 
2(f). The orjL^^anisms largely originated from animal 
manure and chicken yard soils and were distinctive in 
their ability to grow at 5.S °C. In spite of the subspecific 
name baiodurans, these bacteria had only moderate tol- 
erance Lo NaCl (around !2%) in comparison to other 
groups. However, the name is to he con-^rrued in the 
context of B. dcalophilus^ which has much io\ver tolerance 
LO NaCl (Boycr ti aL, 1973). 

Phenon 6. Of the nine strains m phenon 6, rhree showed 
less than 65% DNA hybridization with the reference 
strain and only one less than 60%. The long rods typical 
of the reference strain ate shown in Fig. 2(g), These 
strains had the highest G + C contcni of all the organisms 
included in the study (437 mol%). They reduced nitrate, 
were unable to hydrolysc Tween or puHulan, and were 
cjiirly sensitive to NaCl. 

Phenon 7. This phenon was identified as JB. tohnii and 
included several strains from the Wadi Natrun in E^^ypr 
and from Mono Lake (CaJifornja). These strains were 
excluded from the DNA study because they have been 
charactcri;ced previously (Spanka & Fritzc, 1993). Our 
icsuits confirm the conclusions of the earlier study and 
emphasize the relatively weak NkCI tolerance of these 
bacteria. 

Phenon 8. Tlie low level at which phenon 8 was defied \n 
the 5j/UPGMA analysis was highli^rhted by rhe DNA 



hybridization study, which revealed two groups of strains 
and two further strajns which were unrelated to these 
groups but could be related to each other (they share the 
same base composition; see Table 4). Croup ba (two 
strains) could be distinguished from group 8b (four 
strains) by consistent differences in l?ase composition of 
DNA and the sensitivity of the former srrajns to 5% 
NaCl. Moreover, strains of group 8a, in contrast to most 
strains of group 8b, failed to hydrolyse Tween 40 a^id 60 
and to grow on glycerol. There ^erc differences in 
carhohydrare uulization patterns between strains of 
groups 8a and 8b, particularly with regard to flmygdalin 
and N-acetylglueoSamine utilization. Morphologicilly, 
the reference Strains of the two groups were similar but 
not identical (Fig. 2h, i). 

Phenon 9. One of the five phenon 9 strains was unrelated 
to The reference strain by DNA hybridization and had a 
shghdy higher DNA base composition than other 
members of the clusicr (Tabic 4). These bacteria failed to 
grow at pH 7 or above pH 10 and at temperatures above 
40 'C. They were tolerant to moderate concentraiions of 
NaCl (10%). The reference strain produced oval endo- 
spores in a sbghtly swollen sporangium (Fig. 2)). 

Phenon 10. This phenon comprised six strains, of which 
four showed > 74 % DNA rcassociarion lo B. alcaiophilus 
DSM 485^, one showed only 57% und the sixth did not 
yield high-quality DNA. The strains in this group showed 
very similar physiological properties to those of phenon 9 
but were less salt rolcrant (growth up ro 8% NaCI) and 
had a slighdy lower G + C content. Typical morphology 
of these cells is shown in Fig, 2(k). 

Phenon 11. The six strains in phcnou 11 were notable for 
their homogeneous base composition (41-1 mol % G+ C), 
with only one strain showing less than 65% DNA 
hybridization lo the reference stram. Phenotypically, they 
were characterized by lack of starch and pullulan hy- 
drolysis, relatively inw tolerance ro NaCl and inability to 
grow at 40 "C or above. The distinctive short fat cells of 
the rcfeience strain containing oval endospores arc shown 
m F,g. 2(1). 

Single-membered dusters. Nine strains were recovered ixi 
single-mcmbered clusters and two in a doublet in the 
J"j/UPGMA analysis. To determine if these bacreria were 
foci of new clusters or aberrant membcr.s of extant 
clusters, DNA was prepared and hybridized to the DNA 
from the reference srrains of the 11 clusters. Except for 
stram PN-57, which was allocated to chc DNA 
hybridization group of phenon 10, there was no significant 
hybridization of the DNA from the singlc-membercd 
clusters to any of rhe reference DNAs. This confirmed 
that die majority of these bacteria did not belong to any of 



^^d^'cf^^^'^'^Vv'^ non-5porulating cells of tepresentative strains of DNA hybritJizatign groups estabffshed in this 
JJL? Ml, alkaline nutrient agar supplemented wrth 5 g Nad f- and 10 mg Mn^' ml' M2 a kaMne 

St^a 'n PnI (croTo' lTlTt T^l? .k?'.' water LbstKuted v^ith so' e«raa Bar, 10 a) 

3 Ml 2d 30°r riU^ ■ tl^Ti^' ^H-'r. ^'^■^^^ 2): Ml. 1 30 X ar^d 4 d. 18 ^C. (c) Strain PN:i02^^roup 

Z Lin RM Ml Vn ^ . ^ ^^-"^'^ ^^"''^8 (9roup ga): Ml. 2 d. 30 X (!) Strain 
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the clusters shown in Fig. 1 (data not shown). Moreover, 
the possibility that these bacteria may be related to each 
Other but failed lo cluster m the phcnogram was excluded 
by preparing partial 16S rRNA scc^ueoccs and analysing 
them in the context of die full sequences for the reference 
strains (Niebcn e/ aL, 1994). Ail of the partial sequence? 
were rccovcced mdcpen<icntJy in rhe phylogcnctic tree 
(data nnr shown), confirming chat they were indeed foci of 
new species for which we had only single representatives. 

DISCUSSION 

When faced with a heterogeneous collection of bacteria, 
the construction of a bask taxonomic framework from 
which detailed studies of species can be initiated is 
somctijiaes a daunting task. Numerical phcnedcs has 
become established as a rapid and reliable procedure for 
generating this initial daiabasc in most instances. Fot 
example, numencai analysis of Strcptomyca established the 
framework for future taxonomic st-udies of species within 
this genus (Williams ti al,, 1983) and a recent numcrlc^)l 
analysis of the themiophihc 'hacillm strains (White et aL, 
1993) established species which were subsequently 
evaluated by DNA feassociation and mdepeodcndy by 
comparauve rRNA sequence analysis (Rainey tl ai., 1 994). 
This current assessment of ihe diversity of the alkaJiphilic 
haciilus sr rains confirms numerical phcnctics vts an ideal 
approach fox gauging the species diversity within a group 
of bftctcria and indicating reference strains for molecular 
analyses smce, with only a few exceptions, the phena 
defined by the Jj/UPGMA analysis were fully substan- 
tiated by DNA fcassociation and rRNA sequencing 
(Nielsen f/fl/. J 9y4). 

When usin^,' numerical phcnctics in this wny, it is a benefit 
if a minimum of tests can be usetJ and, wherever possible, 
these can be based on test kits or automated systems. We 
have coTifiimed previous numerical analyses (e.g. Pnest si 
a!., 1981) which showed thai accurate ciusteting can be 
detected with fewer than 50 characters, the only problem 
being that with this reduced database some taxa are not 
separated. Thus in the above-mentioned study, B. 
amyloliqutjacnns could not be distinguished from B. snbiilh 
and individual B. sphatrkus DN A hybridizarion groups 
were indistinguishable. Similarly, in this study, clusters 4 
and 8 were shown to be heterogeneous by DNA 
hybridi^adon. Such lack of resolution is to bt cxpecTtd 
from such a small number of tests (Sneath & Sokal, 1973) 
and it is likely that with nnore characters these taxa would 
have been separated. 

API test systems have been used successfully for the 
cUssification and identificaaon of bacillus sirains (Logan 
6c Berkeley, 1984). However, some alkaliphilic 
su^tms, notably those from clusters 2 and 7, were uniiblc 
to produce a detectable reacuon m the API tests even 
though the system had been modified for u^e at high pH. 
Given the problems of conduciing physiologicjil tests at 
extremes of pJl, this was not surprising aad illustrates the 
difficuidcs involved For these reasuns, in some previous 
taxonomic studies of these bacieria, the Organisms were 
first 'i^dapLed' to growth at pH 7 and all tests were 
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Fi'g. 3, Comparison of phenotypk similarity calcuiated from the 
5j coefficient with DNA hybridization for 230 pairs of strains. 



performed ai neutrality (Gordon & Hyde, 1982). How- 
ever, this is an artificiaJ answer to the problem and may 
lead 10 erroneous physioiogicaJ reactions. 

The numerical analysis was evaluated by detcfmiaation of 
the DNA base composidon of all the strains in the study 
and extensive DNA hybridization reacdons. In addition 
to revealing the heterogeneity of clusters 4 and 8, ihe 
DNA rcassociadon reactions also showed that most 
clusters contained strains which were genomieaily 
unrelated, but in all cases these were a minority of strains. 
We defined DNA hybndizauon groups at 65%, Nvliich is 
5% below the currendy accepted guideline used in (he 
species definidon (Wayne stai., 1987), but given the error 
rales of around 10% (standard deviadon) for our 
detetminations using an immobilized assay, we consider 
this acceptable. Indeed, this limit was shown to be 
appropriate by plotdng percentage DNA/ DNA 
reassocianon against phenetic similarity dciennined by 
for 230 pairs of strains (Fig. 3). Although similar results 
have been presented previously (Stalcy & Colwell, 1973) 
they have not been based on so many determinadons and 
do not show so clearly the cut-off of 'species pnirs' at a 
littic above 60% DNA hydridizarion and greater, with 
most non-homologous pairs showing less rhan 40% 
binding. This provides good evidence for the discon- 
tinuous spread of DNA hybndizadon values among Lhest 
bactenal species and for the use of DNA hybridization for 
spcciation. However, exact defilniuon of die level of DNA 
hybridization for the .species boundary and ihe difficulties 
presented by subspecies remain problemauc. 

Ii IS clear from Fig, 3 that pairs of bactenal strains which 
show > 65 % DNA hybridization invariably show high 
phcnorypic reiatedncss as judged by Jj and diat most 
strains which show low DNA reUtcdness present 25-GO % 
phenetic similarity. However, some strains which were 
phenorypicaJly similar were genomieaily unrelated. This 
is probably a result of ihc limited phenotypic database and 
reflects the problems associated with heterogeneous 
clusters. It would be interesting to repeat this exercise 
with a full numerical taxonomic diitabase, in which case n 
is likely ihar these aberrant strains would be correctly 
placed. 

It is also useful ic compare the DNA h)^bridizarion data 
with the full 16S rRNA sequence similarities which arc 
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Ffg. 4. Companjon of sequencQ similarity based on the full 16S 
rRNA sequence (Nielsen et a/., 1994) with DNA hybridUation for 
represenutfve alkafiphilic fldc/7/uJ strains. 



available from our previous study (Nielsen al., iy94). 
The data in Fig. 4 include only genomically uilrclated 
strains since strains with high DNA hybridizatjnn should 
have viriually identical rRNA sequences. The scatter of 
points IS simHar to that obtained by Amann ei aL (1992) 
for Fihrohacter strains and for Daulfovihno by Dcvcreux et 
nl. (1990), and shows that ar bclo'w approximately 99% 
rRNA similarity DNA hybridization is below 50% and 
that Strains with rRNA similarities at this level definitely 
represent distinct genospccies. In contrast to the previous 
studies, iher<r was no indication rhat pairs of strains 
showing rRNA similarity below 9^ % show lower DNA ' 
hybridization than those with rKN/V similarity between 
94 and 99 %. Tt has been suggested that these lower values 
could be cqa^itcd with the generic boundary (Amann et 
at., 1992). 

The phylogcnetic positions of aiJ 13 alkalipbihc geno- 
specie?; which have been defined in this study have been 
reported by Nielsen ci aL (1994). Strains which had been 
included m ptevioup phcnctic taxonomic studies were, 
mainly recovered in phcna 1, 4a, 4b, 5, 6, 7 and 10, with 
one new isolate each being assigned to phcna 1 And 6 and 
two m phcnon 10. AU strains of phcna 2, 3, 8a, 8b and 11 
u-erc provided by 14. Outtnip (Novo Nordisk, Denmark), 
as were the four new isolates mentioned above, and only 
limited infornifltion from prcviotis studies is available for 
these strains. Phenon 7 strains were identified as cohnii 
and related taxa (Spanka & Fritzc, 1993) and are not 
further discussed; other taxa arc discussed sSeparately 
below. 

Phenun 1 (DNA hybridization group 1) contained strains 
which had previously been identified as B.firmus (Gordon 
& Hyde, 1982), although strain FN- 77 was an exception, 
since it had been classiiicd originally as a B.fi'rmuj-B. kntnj 
intermediate (Gordon & Hyde, 1982). All of these strains, 
including PN'77, were classified as phenotypic group 1 by 
Fritzc tt aL (1990). These bacteria arc phylogeneticalJy 
unrelated to J3. prmns (Nielsen et al., ] 994) and dieir mean 
DNA base composition \s slightly lower than that for B, 
firmus (39-6 mol % against about 41 mol % , Fahmy it aL, 
1985; Frit/ef/tf/., 1990). Moreover, unlike B^firmu:, mbst 
of these bacteria are unable lo grow at pH 7, are tolerant 
of very high levels of NaCl, and dcaminate phenylalanine, ' 



a variable feature for B. prmt4s sensu strtc/o (Gordon st al 
1973, Priest ct aL, 198K). They therefore represent a new 
species for which we recommend the name 'B. 
p:tf4dofirmus\ 

Strains in clusters 2 and 3 were phylogcnciically removed 
from all other bacilli, The closcsr relatives were J3. 
akaiophilus DSM 485^ and strains representing clusters 1, 
4a, 4b, 5, 6. 9 and 11 of this study (Nielsen et aL^ 1994). 
DNA hybridization confirmed that these clusters com- 
prise separate taic^ at the species level and they were 
unique in this study in requiring sodium ions for growth. 
We suggest the names 'B. agar adhmr ens' and 'B. clarku 
for clusters 2 and 3 respectively. 

Phcnon 4 was divided into two hybridization groupb but 
both contained strains which had previously been 
classified as B, lentuj rype I (Gordon & Hyde 1982) or 
phenotypic group 4 (Fritze et aL, 1990). These bacteria 
were unrelated to B. /^w/'wj phylogcnerically (Nielsen et aL, 
1994) and had a higher DNA base composirion than B. 
lentus strains (38^39 mol % G + C versus 36 mol % for the 
type strain of B. Imus; Fahmy ei aL, 1985). Unlike B. 
lentus, these bacteria were protcoiyric and grew in 10% 
NaCI; they cannot thcrcfote be included in this species. 
Phcnon 4 comprised two DNA hybridization groups and 
the reference strains were unrelated by 16S rRNA 
sequence analysis (Nielsen ct ai,, 1994). Although these 
molecular features clearly support two mdependent and 
separate taxa, tlie phcnctic data are insufficient to provide 
diagnostic features at presem. The only tndica[ions of 
differing phenotypic properties are the inabihty of group 
4a strains to grow at 40 °C while 4b strains grow at 42 °C, 
and the slightly Urgcr sj;5e of strain PN-1 (4a) cells. To 
comply with ihe recommendanons of the Ad Hoc 
Committee on Keconciliation of Approaches to Bacieriai 
System atus (Wayne ct aL, 1987), wc refrain from naming 
these t-axa until more strains have been isolated and 
funher diagnosric features are available. Vac proposed 
reference strain for t^xon 4a is PN-1 (^ DSM 8714 — 
NCIMB 10291); that for 4b is PN-26 (= DSM 8717 ^ 
NCIMB 10288). 

DNA hybridizauon ^^roup 5 comprised 16 striuns, in- 
cluding 'B. akaiophilus subsp. halviurans' (Boyer tt al.^ 
15^73). These strains were previously assigned ro S. lemur 
type 111 (Gordon ^ Hyde, 1982) and phenotypic group 4 
(Fritzc H aL^ 1990). Again, rRNA sequence unrelatcdness 
(Nielsen etaL^ 1994) and gross difference in moi % G + C 
(42-6 % for phcnon 5 strains) exclude these bactcna from 
B, kntus as well as B, akaiophilus. Wc propose to revive the 
name halodurans, in the new combination 'B. halodurans' 
for these bacteria. 

All Strains of DNA hybridization group 6 were previously 
assigned to B. lentus type 11 by Gordon & Hyde (1982) and 
to phcnot>7)ic group 3 by Fntze t-f al. (1990). Thc^c 
bacteria are distmcr from B. kntus on the basis of their 
rRNA sequences (Nielsen ct aL, 1994), m DNA base 
composinon (43-7 mol% G + C), and in several pheno- 
t>pic attributes such as casein hydrolysis, nitrate rcducrion 
and ability- to grow nt 50 (ail negadvc for B. kntus and 
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posiuve for DNA hybndizauon group 6). diercfore 
propose thar these baccem should be assigned to a new 
species, for which the name '8. fW/' js recommended. 
Two indcpcndem DNA hybridization groups were 
defined wiuhm phenon 8 sixains. Agiin, the disrmct 
posiuons of the reference strams according to rRNA 
sequence analysis underline the validity of these taxa. 
AJrhough diagnostic phenotypic features arc scarce 
growth in 5% NaCI and cell morphology (group 8a 
strains arc considerably larger than 8b strains) consistcmiy 
separate these groups and therefore we propose both as 
new spccjep: halmapalus' for taxon 8a strains and 'B. 
horikosbiV for taxon 8b Strains. 

Clusiers 9 and 10 were two further taxa v/hach were 
diflicuh to separate phenotypically. Virtually all strains of 
both phcna were recovered in their corresponding DNA 
hyhndizatjon groups and the reference strainb had distinct 
posidons according to rRNA sequence analysis. How- 
ever, rhe higher NaCl tolerance of group 9 sirains 
compared with those of group 10 and rhe distinctly 
swollen sporangia of the former strains provide two easily 
determined and definitive features for identification. 
Group 10 strams represent the species i?, alcalophilus and 
the data presented here are used to provide an emended 
description of this previously monorypic species. We 
propose the name ' h, pseudaUalophilus' for group 9 strains 
in recognition of their close phcnotypic and phylo^renetic 
relauonship with J3. alcalophilus. 

Phenon 11 strains were well defined phcnotypically, fay 
DNA hybridization and by RNA sequence analysis 
(Nielsen tt ai, 1994). Lack of starch and pullulan 
hydrolysjs and low pH optimum and NaCl tolerance for 
growth are characteristic of these baacria. The name 
gih.tvnii' is proposed for these organisms. 

NOMENCLATURE 

Emended description of Bacillus alcalophifus (Vedder, 
1 934) 

Tins descnpi^on is based on the strains of DNA 
hybridization group 10 of this study. Colonies are white, 
circular, smooth and shinin^^, sometimes with a darker 
centre, Cells are rod-shaped (0-5-0'7 x 3 0-5-0 um) pro- 
ducing ellipsoidal cndospores (0-5-0-7 x 0-9-1 ^tn) 
which arc located sub terminally and do nor swell the 
sporangium. Strains of this species hydrolyse casern 
gelatin, pullulan, starch, Twcen 40 and Twxcq CO, They 
do not hydrolyse hippuratc, MUG or Twecn 20 (strain 
PN-138 IS weakly positive). Njtrate is not reduced to 
nitrite (strain PN-lOl is an exception) and strains do not 
deuminatc phenylalanine. No growth is observed at 
pH 7-0; tlie optimum is about pl-l U) (pH 9 0 for strain 
PN-57). The tcnnpcrature range for growtl^ is 10-40 °C 
There IS a variable reaction to NaCl: the type strain and 
strain PN 57 arc sensitive to 5% NaCl but other strams 
Lolcraic 8% NaCl. Carbon source utilization profiles 
showed that strains use L-atabinose. mclibiose and rham- 
nosc for growth. The chromosomal DNA base com- 
position is 36-2-.38'4 mol % G -hC. 



Source: soil and faeces. 
Type .nrain: DSM 4R5. 

Description of Baciflus agaradhaer^tts sp. nov. 

Banlluj agaradhaerens (a.gar.ad'hae.rcns) sp. nov, Malayan 
n. ff^ar gelling polysaccharide from brown ai^ae; L adj 
adhamns adherent; ML adj. agaradhatnm adhcrin^^ to the 
agar which is characteiisdc of these colomes. 

This description is taken from this paper for DNA 
hybridization group 2 and unpublished results of H. 
Outtrup. Colonics are white and rhizoid with a fila- 
mentous margin. Cells arc rod-shaped (0-5-0-6 x 2- -5 ^m) 
and produce ellipsoidal spores (0*6- 1-0 x 1-0-1 -6 ^m) 
subtermioally positioned witl^in a sporangium, which i$ 
clearly swollen. Strains of this species arc characccrii^ed by 
hydrolysis of Tween 40 and 60 (PN-lOS is negative for the 
former) casern, gelatin, starch, cellulose and xyian. 
Hippiirare,_ MUG and Tween 20 are not hydrolysed and 
phenylalanine is not deaminated. Nitrate is reduced to 
nitrite. Strains of thi.-^ specie? ate strictly alkaliphiUc: no 
growth is observed at pH 7-0 and optimal growth is 
observed at pH 10 0 or above. Growth occurs within a 
temperature range of KM5 ""C. Tolerance to 1 6 % NaCl is 
observed (strain PN-107 grows only weakly, at this NaCl 
concentration). Carbohydrate uttlizanon profiles show 
growth on L-arabinose, galactose, mannose, N- 
flcctylgiucosaminc or 2-ketogluconate and most strains 
grow on tagatose but no growth on methyl /T-D-icyloside, 
inositol or xylitol is observed. The chromosomal DNA 
composition is between 39-3 and 39-5 mol % G + C. ' 
Source: soil. 

Type strain of this species is PN-105 (- DSM 8721). 

Description of Badllus darkii sp. nov. 

Bacillus clarktt (dai'ki.i) sp. nov. ML gen. n. darkii ftom 
Clark, named after the American bacteriologist Francis E. 
Clark, who made pioneering studies on the taxonomy of 
endospore-forming bacteria. 

This description is taken from the current study based on 
strains of DNA hybridization group 3. Colonies are 
circular with an entire margin and a smooth surface and a 
crcam-white to pale yellow colour (strain PN-104 eventu- 
ally changes to a dark yellow colour, probably due to 
synthesis of an excreted pigment). Cells are rod-shaped 
(O-Cr-0'7 X 2-0-5-0 ^m) and produce ellipsoidal spores 
(0'r>-l'0 X 07^1 -2 fim) located subtcrminally- In the type 
strain, the sporangium is distinctly swollen by the spore. 
Strains of this species hydrolyse casein, hippurate, gelatin, 
Tween 40 (strain PN-104 weakly positive) and Tween 6o! 
Starch, MUG, Tween 20 and pullulan are not hydrolysed. 
Growth is strtaly aJkaliphilic : no growth is observed at 
pH 7-0 and optimal growth is above pH 10-0. Growth is 
observed between 15 and 45 °C and strains are tolerant to 
16% NaCl. No growth is observed in minimal medium 
for carbohydrate utilization at pH IQ-O. The chromosomal 
DNA composition is between 42-4 and 43-0 mol % G + C- 
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Source : boil. 

Type strain of this species is PN-102 (= DSM 8720). 

Description of Bacillus clausii sp. nov. 

Bac/lHs cla^m (clau^si.i) sp. nov. ML gen. n. cLusii of 
Ckus. named after Dieter Claas, the Certnan bactcri- 
ologibt who made fundamental contributions to the 
taxonomy of Bacillus. 

The description given below is ukcn from chib srudy for 
DNA hybridization group 6 strains. Colonies are whue 
and filamentous \rith a filanienrous margin. Cells are rod- 
iihaped (0-5^)'7 x 2-fM O urn), producing spores Nvhich 
are ellipsoidal (0-4--0-6 x 0-7-1 -0 ^im) and located sub- 
lerminally to paracentraJly in the sporangium, which may 
be slighily swollen. Cells lend lo occur m long chains 
which may carry spores. The strains jn this species arc 
characTcri/ed by hydrolysis of casein, gelatin and starch 
bui nut pullulan, Twccn 20, 40 or 60 or MUG. Nitrate is 
reduced hut phetiyiiilanine is not dcaminaied. The pH 
optimum is about 8, and good growth is obtained at 
pH 7 0. Growth at temperatures from IS to 50 °C is 
observed, with .salt tolerance up to about 10% NaCl 
(strain PN-76 tolerates only 8% NaCi). arbohydratc 
utilization patterns show growth on L-arabinose, galac- 
tose, xylito], d\ildLoi, sorbitol, methyl ct-D-m annoside, 
mannosc, N-acetylglucosammc, n-tagatose or 2-kctoglu- 
conate. Some strains grow on inositol. The chromosomal 
DNA composition is between 42-8 and 45-5 mol % G +C. 
Source: soil. 

Type strain of tliis specie? is PN-23 DSM 8716 ^ 
NCIMB 10309), 

Description of Bacillus gibsonii sp. nov. 

Ban/ft^, gibsonii (gib.so'ni.i) sp. nov. J, gen. n. from 
Gibson, named after the British bacteriologist Thomas 
Gibson, who made fundamental contributions to the 
taxonomy of Bacilius, 

The description is taken from the current study for strains 
of DNA hybridization group 1 1 . Colonies are yellow and 
circul'^r wiih an entire margin and a smooth, shiny 
surface. Cells arc usually stout rods (0-6-10 x 2-0-3-0 j^m) 
with cliipsoida) spores (O'C^l-O x 0'8-l'6 ^m) su uatcd 
subLermmally and in ageing cells paracentrally to somc- 
umes lateral. Spores do not cause swelling of the 
sporan^rium. Strains of this species hydrolysc casein, 
^clatm and MUG. Some strains, but not the type strain' 
hydrolysc Twccn 40 and 60. No hydrolysis of starch or 
puUuIan 15 observed, phenylalanme is not dcaminatcd and 
reduction of nitrate i.-^ variable among ^.trains. Growth 
occurs at pll 7-0. with an optimum at about pH 8-0. 
Strains grow in the temperature range from 10 to 37 'C, 
with the type strain failmg to grow at 37 All strain? 
tolerate up to 9% NaCl, whde strain PN-111 grows at 
12% NaCl. Carbohydrate utilizauon pattern shows 
growth on lactose, but not on glycogen or .N- 
acetylglucosamme. The chromosomal DNA composition 
IS between 40-6 and 41 *7 mol % G + C. 



Source; soil, 

Type strain of this species is PN^109 (= DSM 8722). 

Description of Bacillus halmapalus sp. nov. 

Baai/m haimapaius (hal.ma'pa.lus) sp. nov. Gt. n. halmt, 
brine; Gr. adj. hapalof, delicate, ML adj. haimapaius, 
sensitive to brine. 

This description is taken from this study based on strains 
of cluster 8a. Colonies arc small, arcular with an entire 
margin, shiny surface and a cream-white colour. Cells are 
rod shaped (0-6-1 -0 x 3'0-4-0 urn) with ellipsoidal spores 
(0-6-O-8X 0-9-1*5 Mm) located subrermmally to para- 
centrally not swelling the young sporangium. The two 
strains in this species hydrolyse casein, gelatin, hippuratc, 
pullulan and starch. They do not hydrolyse Twecn 20, 40 
and 60 or MUG. They grow at pH 7*0, with an optimum 
at about pH R-0. Growth is observed between 10 and 
40 °C. Salt tolerance is very low, with no growth in the 
presence of 5% NaCl. Carbohydrate utilbjition profiles 
show no growth on glycerol, dbosc, D-xylose, L-ar- 
abinose, galactose, rliamnose, sorbitol, lactose, melibtosc, 
melizitose, D-raflhnosc or D-tagatose. The chromosomal 
DNA composition is 38*6 mol % G + C for the two 
currently characterized strains. 

Source; soil. 

Type strain of thJS species is PN-118 (= DSM 8723). 

Description of Bacillus halodurans sp. nov. 

Baci/lus/ja/oduransap. nov comb. nov. ^nom. rev. {'Bacillus 
akalophilus subsp. baio(krans\ Boyer ei aL, 1973) 
ha.lo.du'rans. Gr. n. hals salt; L pres. part! durans 
enduring; ML adj. hahdurans sait-cnduring. 

This description is taken from this study for strains of 
DNA hybridiyation group 5 ajad Fritzc et al. (1990) and 
corresponds largely with the original description. Col- 
omes are white and circular with a slightly filamentous 
margin. Cells arc rod-shaped (0-5-0-6 x 3'O-4'0 ^m) and 
produce spores which arc ellipsoidal (0'5-0'6 X 0-8-1 "2 Mm) ^ 
and located Subterminaliy in the sporangium, which is 
slightly swoDcn. Cells tend to occur jn long chains 
frequently bearing spores. Hydrolysis of Tween 40 and 
60, casern, gelatin, starch and pullulan is obtained. Most 
strains do not hydrolyse Twcen 20 (strains PN-62 and 
PN-eO positive) or hippuratc (stram PN-31 positive), and 
do not reduce mtrate (strains PN-31 and PN-42 positive). 
MUG is not hydrolysed and phenylalanine is not dcami- 
naied, Most strains grow at pH 7 0 (strains PN-31 and 
PN-42 do not) but optimal growth is obtained around 
pH 9 -10. Growth temperature range is 15-55 °C Strains 
show moderate halotolerancc; good growth is obtained 
up to 12 % NaCi. Carbohydrate utiii>-ation pattern shows 
growth on L-arabinosc, galactose, xylitol, inositol, 
mannose, .V-acetylglucosamme or 2^ketogluconwtc, but 
not with dulcjiol sorbitol, methyl a-D-mannoside or 
D-tagatose. The chromosomal DNA composition is 
between 42-1 and 43-9 mol % G + C. 
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Source: Soil. 

Type strain of this species is PN-80 (= DSM 407). 

Description of Bacillus horikoshli sp. nov. 

Bacilius honkoshn (ho.ri.ko'shi.i) sp. nov. ML gen. n. 
honkoihu of Hunkoshi: named after the Japanese nucro- 
bioJogisT Koki Horikosbi, who has made fundamental 
contribuEions to the study of alkaliphilic bacteria. 

This description is taken from this study based on strains 
of DMA hybridj^iation group 8b. Colonics arc small, 
circular with an entire margin, shiny surface and a cream- 
whirc colour. Cells arc rod-shaped (0'6-0-7 x 2-0-4-0 \Lm) 
with ellipsoidal spores (O-S-O ? X 07-1-2 ^rti) located 
subtcrminally m a sporangium which may be slighdy 
swollen. Strains of this species hydrolysc casein 
hippuriitc, gelatin, pullulan and starch. Three of the four 
strains hydrolysc Twccn 40 and 60. Growth is oKservcd at 
pH 7 0, with an optimum at about pH 8*0. Strains grow 
between 10 and 40 '^C. Salt lokrancc is mbiicrare, with a 
maximum nt 8-9 % NaCl Strains do nor hydrolysc MUG 
or Twcen 20, do not dcaminare phenylalanine and do not 
reduce nitrate lo nitrite. No growthis observed on ribose, 
D-xyiosc, L-arabinosc, galaaosc, rhamnose, sorbitol, lac- 
tose, mclibiobc, melizitose, D-raflFtnosc or D-tagatosc.' The 
chromosomal DNA composition is between 411 and 
42-0mol%G + C. 

Source: soil samples. 

Type Strain of this species is PN-121 (= DSM 8719). 

Description of Bacillus pseudalcalophilus sp. nov. 

Badl/»j pseudakalophiks (pseu.dal.ca.Io'phi.lus) sp. nov. 
Gr. adj. pseudes false; specific epithet akalophilus\ ML adj. 
pseudalcalophilus, fal.se alc&lophllus because it is pheno- 
typically closely related to B. akalophilus . hut phylo^ 
genetically distbct. 

Ihc description is taken from this study for strams of 
D>JA hybridization group 9. Colonies are white and 
circular with nn undulate margin. Cells are rod shaped 
(0-5-0 6 X 2-0 4-0 )im) and produce ellipsoidal spores 
(0'5-0-7 X 0-8-1-3 \xm) which swell the sporangium and 
are situated paracentraliy to Subtcrminally. Strams of this 
spectcs hydrulyse casein, gelatin, starch and pullukn, but 
hippurate, MUG and Twecn 20 arc not hydrolysed. 
Phenylalanine is nut deaminated and nitrate is no: 
reduced. No growth is obtained at pH 7'0 and the 
optimum h about pH lO'O. All strains grow from 
ICMO °C:. Strams tolerate up to 10% Nad. Carbohydrate 
utilization profiles show growth on L-arabinosc or 
galactose, with some strains able to grow on cagatose or 
2-ketogluconate. No growth is observed on mosuol, 
xyhtol, dulcitol, sorbitol, mcthyj oc-D-mannoside, N- 
acetylgiucosamine or manoose. The chromosomal DNA 
composiMon h between 38 2 and 39*0 mol% G + C. 

Source: soil samples. 

Type Strain of this species is PN-137 DSM 872.S). 



Description of Bacillus pseudofimus sp, nov. 

Bacillus psttidofirmHs (pseu.do-fir'mus) sp. nov. Gr adj 
pstud4:s false; specific epithet firmu,. ML adj. pssudofirmns 
the hh^firmus - referring to phy.^iological similarities to B 

This description is taken from this study for strain? uf 
DNA hybridization group 1 . Colonies are yellow and 
round with irregular margjns. Cells arc tod^shaped 
(0-6--0-8 X 3'0-6'0 urn), producing spores which are oval 
(0-5-0-7X 0-5-1-2 Jim) and located centrally to sub^ 
terminally without swelling the sporangium. The strains 
in this species are characterized by hydrolysis of Twccn 40 
and 60, gelatin, casein nnd siarch. Hippurate, MUG, 
pullulan (with the exception of PN-3, PN-lO and PN-72) 
and Tween 20 arc not hydrolysed, nitrate is not reduced 
but all strains dcaminatc phenylalanine. The strains are 
strictly alkaliphilic : for most of the strains no growth is 
obtained at pH 7-0 and the optimum is around pll 9-0. 
The strains show growth from 10 to 45 °C. Growth is 
detected in the presence of up to ^6 Vo NaCl, with most 
strains tolerating 1 7 % NaCI. The carbohydrate utilization 
profile shows growth on nbosc or u-xylosc, but no 
growth on L-arabinosc, galactose, rhamnose, sorbitol, 
lactose, melibiosc, mcliziiose, D-raffinosc or D-tagatosc. 
The chromosomal DNA composition is between 39-0 and 
40-8 mol% G-l-C, 

Source: soil and animal manure. 

Type strain of this .species is PN-3 (== DSM 8715 = 
NCIMB 10283). 
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ABSTRACT 

The 4 202 353 bp genome of the alkaliphilic bacterium 
Bacillus halodurans C-125 contains 4066 predicted 
protein coding sequences (CDSs), 2141 (52.7%) of 
which have functional assignments, 1182 (29%) of 
which are conserved CDSs with unknown function 
and 743 (18.3%) of which have no match to any 
protein database. Among the total CDSs, 8.8% match 
sequences of proteins found only in Bacillus subtilis 
and 66.7% are widely conserved in comparison with 
the proteins of various organisms, including 
B.subtilis. The B.halodurans genome contains 112 
transposase genes, Indicating that transposases 
have played an important evolutionary role in horizontal 
gene transfer and also in internal genetic rearrange- 
ment in the genome. Strain C-125 lacks some of the 
necessary genes for competence, such as comS, 
srfA and rapC^ supporting the fact that competence 
has not been demonstrated experimentally in C-125. 
There is no paralog of tupA, encoding teichurono- 
peptide, which contributes to alkaliphlly, in the C-125 
genome and an orthoiog of tupA cannot be found in 
the B.subtilis genome. Out of 11 a factors which 
belong to the extracytoplasmic function family, 10 
are unique to B.halodurans^ suggesting that they may 
have a role In the special mechanism of adaptation to 
an alkaline environment. 

INTRODUCTION 

Generally, alkaliphilic Bacillus strains cannot grow or grow 
poorly under neutral pH conditions, but grow well at pH >9.5. 
Since 1969 we have isolated a great number of alkaliphilic 
Bacillus strains from various environments and have purified 



many alkaline enzymes (1). Over the past two decades our 
studies have focused on the enzymology, physiology and 
molecular genetics of alkaliphilic microorganisms to elucidate 
their mechanisms of adaptation to alkaline environments. 
Industrial applications of these microbes have been investigated 
and some enzymes, such as proteases, amylases, cellulases and 
xylanases, have been commercialized. It is well recognized 
that these commercial enzymes have brought great advantages 
to industry ( 1 ). Thus, it is clear that alkaliphilic Bacillus strains 
are quite important and interesting not only academically but 
also industrially. 

An alkaliphilic bacterium, strain C-125 (JCM9153), isolated 
in 1977, was identified as a member of the genus Bacillus and 
reported as a P-galactosidase (2) and xylanase producer (3). It 
is the most thoroughly characterized strain, physiologically, 
biochemically and genetically, among those in our collection 
of alkaliphilic Bacillus isolates (1). Recently, this strain was 
re-identified as Bacillus halodurans based on the results of 1 6S 
rDNA sequence and DNA-DNA hybridization analyses (4). 

Analysis of the entire genome of Bacillus subtilis, which is 
taxonomically related to alkaliphilic B.halodurans strain C-125 
(Fig. 1), except for the alkaliphilic phenotype, has been 
completed (5). Knowledge of the complete nucleotide 
sequence of the B.subtilis genome will definitely facilitate 
identification of common functions in bacilli and specific 
functions in alkaliphilic Bacillus strains. We have determined 
the complete genomic sequence of alkaliphilic B.halodurans 
C-125 and its genome is compared with that of B.subtilis in this 
study. Alkaliphilic B.halodurans is the second Bacillus species 
whose whole genomic sequence has been completely defined. 

MATERIALS AND METHODS 

Preparation of the whole genome shotgun library 

A 20 (ig aliquot of chromosomal DNA was sonicated for 5-25 s 
with a Bioruptor UCD-200TM (Tosho Denki Co., Japan). The 
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sonicated DNA fragments were blunt-ended using a DNA 
blunting kit (Takara Shuzo, Kyoto, Japan) and fractionated by 
\% agarose gel electrophoresis. DNA fragments 1-2 kb in 
length were excised from the gel and eluted by the freeze- 
squeeze method (6). The DNA recovered was ligated to the 
Sma\ site of pUC18, which had been previously treated with 
BAP, and introduced into competent XL 1 -Blue cells by the 
standard method (7). Transformants with a frequency of 5-6 x 
1()-V|ig DNA were cultivated in LB liquid medium at 37°C and 
1 [il of culture broth was used for template DNA. The insert in 
the plasmid, amplified using a standard PCR method, was used 
for sequencing. 

Sequencing and assembly of the whole genome shotgun 
library 

The genome of alkaliphilic B.halodurans C-125 was basically 
sequenced by the whole genome random sequencing method 
as described (8-10). The DNA fragment inserted into pUC18 
was amplified by PCR using Ml 3-20 and reverse primers. 
PCR fragments, treated with exonuclease I and shrimp alkaline 
phosphatase (Amersham, OH, USA) to eliminate excess 
primers in the PCR reaction mixture, were used for sequencing 
analysis as template DNA. Sequencing was performed with an 
ABl Prism 377 DNA sequencer using a Taq Dye Terminator 
Cycle Sequencing Kit (Perkin Elmer, CT, USA), DNA 
sequences determined by means of the ABI sequencer were 
assembled into contigs using Phrap (http://bozeman.mbt. 
washington.edu/phrap.docs/phrap.html ) with default para- 
meters and without quality scores. At a statistical coverage of 
7.1 -fold, the assembly using Phrap yielded 656 contigs. 
Sequences were obtained from both ends of 2000 randomly 
chosen clones from a \ library (11). These sequences were then 
assembled with consensus sequences derived from the contigs 
of random phase sequences using Phrap. Gaps between contigs 
were closed by shotgun sequencing of \ clones which bridged 
the contigs of random phase sequences. The final gaps were 
closed by direct sequencing of the products amplified by long 
accurate PCR with a LA PCR Kit v. 2 (Takara Shuzo). 

Annotation and clustering 

The predicted protein coding regions were initially defined by 
searching for open reading frames (ORFs) longer than 
100 codons using the Genome Gambler program (10). Coding 
potential analysis of the entire genome was performed with the 
GeneHacker Plus programs using hidden Markov models (12) 
trained with a set of B.halodurans ORFs longer than 300 nt. 
This program evaluates quality of the Shine-Dalgano sequence 
( SD) and codon usage for a series of two amino acids. The SD 
sequence was complementary to one found at the 3'-end of 16S 
rRNA. The SD sequence (UCUUUCCUCCACUAG...) of 
alkaliphilic B.halodurans C-125 (13) is the same as that of 
B.subtilis. Searches of the protein databases for amino acid 
similarities were performed using BLAST2 sequence analysis 
tools (14) with subsequent comparison of protein coding 
sequences (CDSs) showing significant homology (>10 
significance) performed using the Lipman-Pearson algorithm 
( 15), Significant similarity was defined as at least 30% identity 
observed over 60% of the CDS, although those CDSs showing 
<30% identity over >60% of the protein were also included. A 
search for paralogous gene families, such as a factors, ATPases, 
antiporters and ATP-binding cassette (ABC) transporters, in the 
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Position (base pairs) 

Figure 2, Comparison of G+C profile between the genomes of B.halodurans 
and B.subtilis. Distribution of A+T-rich islands along the chromosome, in sliding 
windows of 10 000 nt, with a step of 5000 nt. Known phage (PBSX, SP|3 and 
skin) are indicated by their names and prophage-like elements in the B.subtilis 
genome are numbered from PI to P7. The A+T-rich or G+C-rich regions 
containing iransposases in the B.halodurans genome are also indicated from 
Tl to Tl ] . (A) B.subtilis. (B) B.halodurans. 



B.halodurans genome was performed with stepwise 
BLAST2P, identifying pairwise matches above P < lO'^-lO"^^ 
over 50% of the query search length, and subsequently by 
single linkage clustering of these matches into multigene 
families (16). 

RESULTS AND DISCUSSION 

Genome analysis 

The genome of B.halodurans is a single circular chromosome 
(17) consisting of 4 202 353 bp (Fig. 1) with an average G+C 
content of 43.7% (Table 1). The G+C content of DNA in the 
coding regions and non-coding regions is 44.4 and 39.8%, 
respectively. On the basis of analysis of the G+C ratio and G-C 
skew (G-C/G+C), we estimated that the site of termination of 
replication {terC) is nearly 2.2-2.3 Mb (193°) from the origin, 
but we could not find the gene encoding the replication 
termination protein {rtp) in the genome of B.halodurans. 
Several A+T-rich and G+C-rich islands are likely to reveal the 
signature of transposons or other inserted elements (Fig. 2). 
We identified 4066 CDSs (Fig. 1 and Supplementary Table SI 
available at NAR Online), on average 877 nt in size, using the 
coding region analysis program GeneHacker Plus (12) and the 
Genome Gambler system (10). We have not annotated CDSs 
that largely or entirely overlap existing genes. It was found that 
the termination codon in BH1054, annotated as a transposase. 
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Figure 3. Summary of conserved CDSs identified in the genome of B.halodurans C-125. (A) All CDSs; (B) CDSs with a known functional role; (C) CDSs of 
unknown function 



disappeared due to a frameshift and as a result this CDS was 
combined with the adjacent CDS, presumably coding for an 
ABC transporter/ATP-binding protein. We identify it as a gene 
coding for a transposase in this case. Coding sequences cover 
857c of the chromosome. We found that 78% of the genes 
started with ATG, 10% with TTG and 12% with GTG, as 
compared with 87, 13 and 9%, respectively, in the case of 
B.subtilis (Table 1). The average size of the predicted proteins 
in B.halodurans is 32.841 kDa, ranging from 1.188 to 199.106 
kDa. Predicted protein sequences were compared with 
sequences in a non-redundant protein database and biological 
roles were assigned to 2141 (52.7%) of them. In this database 
search 1 182 predicted coding sequences (29.1%) were identi- 
fied as conserved proteins of unknown function in comparison 
with proteins from other organisms, including B.subtilis, and 
for 743 (18.3%) there was no database match (Table 1). 
Among all of the CDSs found in the B.halodurans genome, 
2310 (56.8%) were widely conserved in other organisms, 
including B.subtilis, and 355 (8.7%) of the CDSs matched the 
sequences of proteins found only in B.subtilis (Fig. 3). The 
ratio of proteins conserved in various organisms, including 
B.subtilis, among functionally assigned CDSs (2141) and 
among the CDSs (1182) matched with hypothetical proteins 



from other organisms was 80.5 and 49.7%, respectively, as 
shown in Figure 3. Of 1 183 CDSs, 23.8% matched hypothetical 
proteins found only in the B.subtilis database, showing 
relatively high similarity values (Fig. 3). 

General features 

Bacillus halodurans C-125 is quite similar to B.subtilis in 
terms of genome size, G+C content of the genomic DNA and 
the physiological properties used for taxonomical identification, 
except for the alkaliphilic phenotype (4). Also, the phylo- 
genetic placement of C-125 based on 16S rDNA sequence 
analysis indicates that this organism is more closely related to 
B.subtilis than to other members of the genus Bacillus. 
Therefore, the question arises of how does the genome structure 
differ between two Bacillus strains which have similar properties 
except for alkaliphily. As a first step to answer this question, 
we analyzed the genome structure both at the level of the 
whole genomic sequence and at the level of orthologous 
proteins, comparing the B.halodurans and B.subtilis genomes 
continuously from the replication origin region (oriC). The 
dots in Figure 4A were plotted when more than 20 bases in the 
B.halodurans nucleotide sequence continuously matched those 
of B.subtilis in a sliding window 100 nt wide, with a step of 
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50 nt. Figure 4B shows the distribution of orthologous 
proteins, comparing B.halodurans and B.subtilis, and the dot 
patterns in these figures resemble each other. About 1500 
genes, some of which constitute operons, mainly categorized 
as genes associated with the following functions, are well 
conserved in the region common to B.halodurans and 
B.suhtilis: mobility and chemotaxis, protein secretion, cell 
division, the main glycolytic pathways, the TCA cycle, metabohsm 
of nucleotides and nucleic acids, metabolism of coenzymes 
and prosthetic groups, DNA replication, RNA modification, 
ribosomal proteins, aminoacyl-tRNA synthetases, protein 
folding, etc. On the other hand, the region around 1 12-153° in 
the B.halodurans genome corresponds to the region around 
212-240° in the B.subtilis genome, as suggested in a previous 
paper (17). 

Transposable elements 

One hundred and twelve CDSs in the B.halodurans genome 
showed significant similarity to the transposases or recombi- 
nases from various species, such as Anabena sp. Rhodobacter 
capsulatus, LmcWcoccus lac t is, Enterococcus faecium, 
Clostridium beijerinckii. Staphylococcus aureus and Yersinia 
pseudotuberculosis, indicating that these have played an 
important evolutionary role in horizontal gene transfer and also 
m internal rearrangement of the genome. These CDSs were 
categorized into 27 groups by similarity pattern and the genes 
are widely spread throughout the genome (Fig. 1 and Table SI). 
As shown in Figure 2, at least 11 A+T-rich and G+C-rich 
islands containing tranposases (Tl-Tll) occur in the 
B.halodurans genome. This is one of the notable features of 
this genome, as B.subtilis has only 10 transposons and trans- 
poson-related proteins. The G+C content of transposases 
varies from 37.4 to 49.2% and codon usage in transposases, 
especially for termination, is obviously different from other 
indigenous genes in B.halodurans. In other bacterial genomes 
It has been reported that Synecosystis sp. PC6803 (18), 
Escherichia coli MG1655 (19), Mycobacterium tuberculosis 



(20), Deinicoccus radiodurans (21) and Lactococcus lactis 
(22) contain many transposase genes, as well as B.halodurans 
C-125. 



Table 1. Comparison of the general features of the B.haioduram and 
B.suhtilis genomes 



Genome features 


B.halodurans 


B.subtilis^ 


Size (bp) 


4 202 353 


4 214 810 


G+C content (mol%) 






Total genome 


43.7 


43.5 


Coding region 


44.4 


44.3 


Non-coding region 


39.8 


39.3 


Open reading frames 






Percent of genome (coding) 


85 


87 


Predicted no. 


4066 


4100 


Conserved with function assigned 


2144 


2379 


Conserved with unknown function 


1182 


668 


Non-conserved 


743 


1053 


Percent AUG initiation codons 


78 


78 


Percent GUG initiation codons 


12 


9 


Percent UUG initiation codons 


10 


13 


Insertion elements 






PBSX prophage-related protein 


2 


27 


Transposase and related protein 


112 


10 


RNA elements 






Stable RNA (percent of genome) 


1.02 


1.27 


16S, 23S and5S rRNA 


8 


10 


tRNA 


78 


86 



"Kunsl etal. (5). 
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Figure 4. Structural analysis of the B.halodurans and B.subtilis genomes. (A) Distribution of regions having a nucleotide sequence common to the two Bacillus 
spp. The dots were plotted when more than 20 bases of the nucleotide sequence matched continuously with that of B.suhtilis in a sliding window 1(X) nt wide with 
a step of 50 nt. The columns appearing as a bar code clearly show the regions shared between the B.halodurans and B.subtilis genomes. (B) Distribution of 
orthologs between the two Bacillus spp. The dots were plotted at the positions where the gene for an orthologous protem exists when comparing the two genomes. 
The columns appearing as a bar code clearly show the positions of the genes for orthologous proteins in B.halodurans and B.subtilis. 
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On the other hand, we could not find any prophage which 
seemed to be active, although several phage-related proteins 
were identified in the B.halodurans genome. The B.halodurans 
genome contains no intact prophage, such as Spp, PBSX or 
skin, found in the B.subtilis genome (5), as shown in Figure 2. We 
confirmed that B.halodurans has the gene for <5^ as the complete 
form, which is divided into two parts, spoIVCB (N-terminal) and 
spollIC (C-terminal) by a prophage (skin element) in the 
B.subtilis genome. 

Origin of replication 

There are 14 CDSs in the oriC region of the chromosome of 
B.halodurans. The organization of the CDSs in the region is 
basically similar to those of other bacteria. The region from 
gidB to gyrA (BH4060-BH4066 and BH1-BH7), especially, 
was found to be the same as in B.subtilis. On the other hand, it 
was found that there are 10 CDSs (BH8-BH18), including 
three CDSs previously identified in the 13.3 kb of the oriC 
region (23), between gyrB and the rrnA operon, corresponding 
to the rrnO operon in the B.subtilis genome (Fig. 1 and 
Table SI), although there is no CDS between gyrA and rrnO in 
B.subtilis. Of these 10 CDSs, only one (BH8) was found to 
have a homolog in another organism, interestingly, not in the 
genus Bacillus; the others were unique to the B.halodurans 
genome (Table SI). 

Transcription and translation 

Genes encoding the three subunits (a, (3 and p') of the core 
RNA polymerase have been identified in B.subtilis along with 
the genes for 19 a factors (24). a factors belonging to the a"^^ 
family {a-^, c^, a^, o^, a*^, a" and a^) required for sporula- 
lion and are well conserved between B.halodurans and 
B.subtilis. Of 1 1 a factors identified in B.halodurans, 
belonging to the extracytoplasmic function (ECF) family, 
is also found in B.subtilis, but the other 10 (BH620, BH672, 
BH1615, BH2026, BH3117, BH3216, BH3223, BH3362, 
BH3632 and BH3882) are unique to B.halodurans. These 
unique a factors may have a role in the special physiological 
mechanisms by which B.halodurans is able to live in an 
alkaline environment, because it is well known that ECF a 
factors are present in a wide variety of bacteria and they serve 
to control the uptake or secretion of specific molecules or ions 
and to control responses to a variety of extracellular stress 
signals (25). 

Seventy-nine tRNA species, organized into 1 1 clusters 
involving 71 genes plus eight single genes, were identified 
(Fig. 1 and Table SI). Of the 1 1 clusters, six were organized in 
association with rRNA operons. Eight rRNA operons are 
present in the C- 1 25 genome and their organization is the same 
as that in B.subtilis (tRNA-16S-23S-5S, 16S-tRNA-23S-5S 
and 16S-23S-5S-tRNA). With respect to tRNA synthetases, 
the C-125 genome lacks the glutaminyl-tRNA synthetase gene 
ighiS), one of two threonyl-tRNA synthetase genes {thrZ) and 
one of two tyrosyl-tRNA synthetase genes [tsrS). The 
B.subtilis genome has all of these tRNA synthetase genes 
except for the glutaminyl-tRNA gene. It is likely that glutamyl- 
tRNA synthetase aminoacylates tRNA*^'" with glutamate 
followed by transamidation by Glu-tRNA amidotransferase 
(26) in both Bacillus species. 



Competence and sporulation 

Out of 20 genes related to competence in B.subtilis, 13 (cinA, 
comC, comEA, comEB, comEC, comER, comFA, comFC, 
comGA, comGB, comGC, comOD and mecA), mainly 
expressed in the late stage of competence, were identified in 
the B.halodurans genome, but we could not find any of the 
genes expressed in the early stage of competence. Among six 
genes whose products are known to serve as components of the 
DNA transport machinery, only three [comGB, comGC and 
comGD), but not the others well conserved in B.subtilis, were 
identified in B.halodurans C-125. Actually, competence has 
not been experimentally demonstrated in C-125 although we 
attempted to use standard (27) and modified methods, 
changing such conditions as pH, temperature and medium, for 
transformation. It has become clear that this is due to lack of 
some of the necessary genes, especially those expressed in the 
early stages, such as comS, srfA and rapC. Only 68 genes 
related to sporulaUon were identified in the C-125 genome, m 
contrast to 138 genes found in the B.subtilis genome. Although 
the minimum set of genes for sporulation was well conserved, 
as in the case of B.subtilis, the C-125 genome lacks some genes 
encoding key regulatory proteins (the response regulator for 
aspartate phosphatase and the phosphatase regulator) and the 
spore coat protein for sporulation conserved in the B.subtilis 
genome. The rap (rapA-rapK) and phr (phrA, phrC, phrE- 
phrG, phrl and phrK) genes especially were not found in the C- 
125 genome, suggesting that C-125 may have another type(s) 
of regulatory gene(s) for control of sporulation in a manner the 
same as or different from that in B.subtilis, as we ha\e 
observed sporulation in B.halodurans. 

Cell walls 

The peptidoglycan of alkaliphilic B.halodurans C-125 appears 
to be similar to that of neutrophilic B.subtilis. However, the 
cell wall components in C-125 are characterized by an excess 
of hexosamines and amino acids compared to that of B.subtilis. 
Glucosamine, muramic acid, D- and L-alanine, D-glutamic 
acid, /Tiei'o-diaminopimelic acid and acetic acid were found in 
cell wall hydrolysates (1). Although some variation was found 
in the amide content of the peptidoglycan isolated from alkaliphilic 
B.halodurans C-125, the pattern of variation was similar to 
that known to occur in B.subtilis. All genes related to pcpddo- 
glycan biosynthesis, such as mraY, murC-murG, cwlA, ddlA 
and glnA, confirmed to be present in the B.subtilis genome 
were also conserved in the C-125 genome (1). A bacitracin 
resistance gene found m the B.subtilis genome is duplicated m 
the C-125 genome (BH464 and BH1521). On the other hand, 
although the tagH and tagG genes were identified in B.halo- 
durans C-\25 (Fig. 1 and Table SI), 13 other genes for teichoic 
acid biosynthesis found in B.subtilis {dltA-dltE, ggcL\, ggaB, 
tagA-tagC, tagE, tag F and tagO) are missing from the B.halo- 
durans genome. Bacdlus halodurans also lacks six genes 
{tuaB-tuaF and tuaH) for teichuronic acid biosynthesis, all 
except tuaA and tuaG, in comparison with those of B.subtdis. 
In addition to peptidoglycan, the cell wall of alkaliphilic 
B.halodurans is known to contain certain acidic polymers, 
such as galacturonic acid, glutamic acid, aspartic acid and 
phosphoric acid. A teichuronopeptide (TUP) is present as a 
major structural component of the cell wall of C-125, which is 
a co-polymer of polyglutamic acid and polyglucuronic acid. 
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Thus, the negative charges on acidic non-peptidoglycan 
components may give the cell surface the ability to absorb 
sodium and hydronium ions and to repel hydroxide ions and, as 
a consequence, may contribute to allowing the cells to grow in 
alkaline environments. A mutant defective in TUP synthesis 
grows slowly at alkaline pH. The upper limit of pH for growth 
of the mutant is 10.4, whereas that of the parental C-125 strain 
IS 1 0.8. The tupA gene encoding TUP has been cloned from the 
C-125 chromosomal DNA (28). In this study, it has become 
clear that B.halodurans C-125 has no paralog of tupA in the 
genome and an ortholog of tupA cannot be found in the 
B.subtilis genome. 

Membrane transport and energy generation 

Bacillus halodurans C-125 requires Na"^ for growth under 
alkaline conditions. The presence of sodium ions in the 
surrounding environment has been proved to be essential for 
effective solute transport through the cytoplasmic membrane 
of C-125 cells. According to the chemi-osmotic theory, a 
proton-motive force is generated across the cytoplasmic 
membrane by the electron transport chain or by extrusion of 
derived from ATP metabolism through the action of ATPasc. 
We identified four types of ATPases (preprotein translocase 
subunit, class III heat shock ATP-dependent protease, heavy 
metal transporting ATPase and cation transporting ATPase). 
These ATPases are well conserved between B. halodurans and 
B.subtilis. 

Through a series of analyses such as a BLAST2 search, 
clustering analysis by the single linkage method examining all 
CDSs identified in the B. halodurans C-125 and B.subtilis 
genomes (8166 CDSs) and multiple alignment (16), 18 CDSs 
were grouped into the category of antiporter- and transporter- 
related protein genes in the C-125 genome. In this analysis it 
was found that five CDSs are candidates for Na+/H+ antiporter 
genes (BH1316, BH1319, BH2844, BH2964 and BH3946). 
However, we could not find any gene encoding antibiotic 
resistance proteins in the C-125 genome, whereas the B.subtilis 
genome has nine different ones. Eleven genes for multidrug 
resistance proteins were identified in the C-125 genome, six 
fewer than in B.subtilis, A non-alkaliphilic mutant strain 
(mutant 38154) derived from B. halodurans C-125 which is 
useful as a host for cloning genes related to alkaliphily has 
been isolated and characterized (29). A 3.7 kb DNA fragment 
(pALK fragment) from the parent strain restored growth of 
mutant 38 1 54 under alkaline pH conditions. This fragment was 
found to contain CDS BH1319, which is one of the Na+/H^ 
antiporter genes in B. halodurans. The transformant was able to 
maintain an intracellular pH lower than the external pH and the 
cells expressed an electrogenic NaVH"^ antiporter driven only 
by Ay (membrane potential, interior negative) (1,29). Bacillus 
subtilis has an ortholog (mprA) of BH1319 and it has been 
reported that a m/^M-deficient mutant of B.subtilis showed a 
sodium-sensitive phenotype (30). On the other hand, a mutant 
of strain C-125 with a mutation in BH1317 adjacent to 
BH1319 has been isolated and it showed an alkali-sensitive 
phenotype, although whether the Na'^/H"*" antiporter encoded by 
BH1317 is active in this mutant has not been confirmed 
experimentally yet. In addition, it has been reported that 
BH2819, the function of which is unknown and which is 
unique to the C-125 genome, is also related to the alkaliphilic 
phenotype (31). 



Bacillus halodurans C-125 has a respiratory electron trans- 
port chain and the basic gene set for it is conserved as 
compared with B.subtilis, but the gene for cytochrome bd 
oxidase (BH3775 and BH3776) is duplicated in the C-125 
genome. It is also clear that two genes for /?^?3-type cytochrome c 
oxidase (BH739 and BH740) not seen in B.subtilis are present 
in the C-125 genome. The C-125 genome has a F|Fo-ATP 
synthase operon (Fig. 1 and Table SI). The gene order in this 
operon (e subunit-p subunit-y subunit-a subunit-5 subunit- 
subunit b-subunit c-subunit a) is identical to that seen in 
B.subtilis. In addition to the FiFq-ATP synthase operon, the 
operon for a Na^-transporting ATP synthase and the operon for 
a flagellar-specific ATP synthase are also conserved between 
B. halodurans and B.subtilis. 

ABC transporters 

Members of the superfamily of ABC transport systems couple 
the hydrolysis of ATP to the translocation of solutes across a 
biological membrane (32). ABC transporter genes are the most 
frequent class of protein coding genes found in the B. halo- 
durans genome, as in the case of B.subtilis. They must be 
extremely important in Gram-positive bacteria such as 
Bacillus, because these bacteria have an envelope consisting of 
a single membrane. ABC transporters allow such bacteria to 
escape the toxic action of many comp<Dunds. Through the 
series of analyses described above, 75 genes coding for ABC 
transporter/ATP-binding proteins were identified in the 
B. halodurans genome. In this analysis 67 CDSs were grouped 
into the category of ATP-binding protein genes, although 71 
ATP-binding protein genes have been identified in the 
B.subtilis genome (5). We found that B. halodurans has eight 
more oligopeptide ATP-binding proteins, but four fewer amino 
acid ATP-binding proteins, as compared with B.subtilis. We 
could not find any other substantial difference between 
B. halodurans and B.subtilis in terms of the other ATP-binding 
proteins, although it should be noted that the specificity of 
some of these proteins is not known. The genes for oligo- 
peptide ATP-binding proteins (BH27, BH28, BH570, BH57I, 
BH1799, BH1800, BH2077, BH2078, BH3639, BH3640, 
BH3645, BH3646, AppD and AppF) are distributed 
throughout the C-125 genome. We speculate that these may 
contribute to survival under highly alkaline conditions, 
although there is no direct evidence to support this. On the 
other hand, 43 CDSs were identified as ABC transporter/ 
permeases in the B. halodurans genome. Surprisingly, B. halo- 
durans has only one amino acid permease, in contrast to the 1 2 
present in the B.subtilis genome. In addition, it is clear that 
B. halodurans lacks the sodium permease gene present in 
B.subtilis, whereas B.subtilis lacks the nickel permease gene 
present in B. halodurans. 

CONCLUSION 

Alkaliphilic B. halodurans is the second Bacillus species 
whose whole genomic sequence has been completely defined. 
The genomic sequence of B. halodurans offers a wealth of 
basic information regarding gene conservation and diversity in 
Bacillus spp. and systematic information that would be 
difficult, if not impossible, to obtain by any other approach. A 
more complete understanding of the biochemistry of this 
organism derived from genome analysis will provide the 
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foundation for clarification of the mechanisms of adaptation to 
extreme environments, especially to a highly alkaline environ- 
ment, as a first step. A new database specifically established for 
the B.haloduratis sequence, ExtremoBase, will be accessible 
through the World Wide Web server at http://www.jamstec.go.jp/ 
jamstec-e/bio/DEEPSTAR/FResearch.html . The sequence has 
been deposited in DDBJ/EMBL/GenBank with the accession nos 
APOOI507-AP001520. 

SUPPLEMENTARY MATERIAL 

Supplementary Material is available at NAR Online. 
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